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NOVEL VIP3 TOXINS AND METHODS OF USE 



HELD OF TBDE INVENTION 



[0001] The present invention relates to novel Vip3 toxins from Bacillus thuringiensis, 
nucleic acid sequences whose expression results in said toxins, and methods of making 
and methods of using the toxins and corresponding nucleic acid sequences to control 
insects. 



[0002] Plant pests are a major factor in the loss of the world's important agricultural 
crops. About $8 billion are lost every year in the U.S. alone due to infestations of non- 
mammalian pests including insects. In addition to losses in field crops, insect pests are 
also a burden to vegetable and finit growers, to producers of ornamental flowers, and to 
home gardeners. 

[0003] Insect pests are mainly controlled by intensive applications of chemical pesticides, 
which are active through inhibition of insect growth, prevention of insect feeding or 
reproduction, or cause death. Good msect control can thus be reached, but these 
chemicals qm sometimes also affect other, beneficial msects. Another problem resulting 
from the wide use of chemical pesticides is the appearance of resistant insect varieties. 
This has beeiji partially alleviated by various resistance management practices, but there is 
an increasing; need for ^temative pest control agents. Biological pest control agents, such 
as Bacillus ihuringiensis strains expressing pesticidal toxins like 5-endotoxins, have also 
been applied to crop plants with satisfactory results, offering an alternative or compliment 
to chemical pesticides. The genes codmg for some of these 5-endotoxins have been 
isolated and their expression in heterologous hosts have been shown to provide another 
tool for the control of economically important insect pests. In particular, the e;q)ression of 
insecticidal toxins in transgenic plants, such as Bacillus ihuringiensis 5-endotoxins, has 
provided efficient protection against selected insect pests, and transgenic plants 



BACKGROUND OF THE INVENTION 
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expressing such toxins have been conunercialized, allowing fanners to reduce 
applications of chemical insect control agents. 

[0004] Other, non-endotoxin genes and the protems they encode have now been 

identified. Patents 5,877,012, 6,107,279, 6,137,033, and 6,291,156, as well as Estrucher 
al. (1996, Proc. Natl. Acad. Sd. 93:5389-5394) and Yu et al (1997, Appl. Environ. 
Microbiol. 63:532-536), herein mcorporated by reference, describe a new class of 
insecticidal proteins called Vip3. Vipi genes encode approximately 88 kDa proteins that 
are produced and secreted hy Bacillus during its vegetative stages of growth (vegetative 
insecticidal proteins, VP). The Vip3 A protein possesses insecticidal activity against a • 
wide spectrum of lepidopteran pests, including, but not limited to, black cutworm (BCW, 
Agrotis ipsilon\ fall armyworm (FAW, Spodoptera Jrtigiperda), tobacco budworm 
(TBW, Heliothis virescens), and com earworm (CEW, Helicoverpa zed). More recently,' 
plants expressing the Vip3 A protein have been found to be resistant to feeding damage 
caused by hemipteran insect pests. Thus, the Vip3 A protein displays a unique spectrum of 
insecticidal activities. Other disclosures, WO 98/18932, WO 98/33991, WO 98/00546, 
and WO 99/57282, have also now identified homologues of the Vip3 class of proteins. 

[0005] The continued use of chemical and biological agents to control insect pests 
heightens the chance for insects to develop resistance to such control measures. Also, 
only a few specific insect pests are controllable with each control agent. 

[0006] Therefore, there remains a need to discover new and effective pest control agents 
that provide an economic benefit to farmers and that are environmentally acceptable. 
Particularly needed are control agents that are targeted to a wide spectrum of 
economically important insect pests, to control agents that efficiently control insect 
strains that are or could become resistant to existing insect control agents, and those with 
increased potency compared to current control agents. Furthermore, agents whose 
application minimizes the burden on the environment are desirable. 

SUMMARY 

[0007] The present invention addresses the need for novel pest control agents by 
providing new genes and toxins that are distinct firom those disclosed in U.S. Patents 
5,877,012, 6,107,279, and 6,137,033, and Estruch et al (1996), and Yu et al. (1997), as 
well as WO 98/18932, WO 99/33991, WO 99/5782, and WO 98/00546. 
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[0008] Within the present invention, compositions and methods for controlling plant 
pests are provided. In particular, novel vip3 nucleic acid sequences isolated torn Bacillus 
thuringiefisis, and sequences substantially identical thereto, whose expression resuks m 
pesticidal toxins with toxicity to economically important msect pests, particularly insect 
pests that infest plants, are provided. The invention is further drawn to the novel 
pesticidal toxms resulting from the expression of the nucleic acid sequences, and to 
compositions and formulations contaming the pesticidal tomis, which are capable of 
inhibiting the ability of insect pests to survive, grow and reproduce, or of limiting insect- 
related damage or loss to crop plants. The invention is also drawn to methods of using the 
nucleic acid sequences, for example in making hybrid toxins with enhanced pesticidal 
. activity or in a recombinogenic procedure such as DNA shuflQing. The invention is 
further drawn to a method of making thetoxins and to methods of using the nucleic acid 
sequences, for example in microorganisms to control insects or in transgenic plants to 
confer protection from insect damage, and to a method of using the pesticidal toxins, and 
compositions and formulations comprising the pesticidal toxins, for example applying the 

. pesticidal toxins or compositions or formulations to insect-mfested areas, or to 

prophylactically treat insect-susceptible areas or plants to confer protection against the 
insect pests. 

[0009] The nucleotide sequences of the present invention can be engineered using 

methods generally known in the art in order to alter the nucleotide sequences for a variety 
of purposes including, but not limited to, broadening the spectrum of pesticidal activity, 
or increasing the specific activity against a specific pest. DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may 
be used to engineer the nucleotide sequences. 

[0010] The novel pesticidal toxins described herein are highly active against insects. For 
example, a number of economically important insect pests, such as the lepidopterans 
Ostrinia mbilalis (European com borer), Plutella xylostella (diamondback moth), 
Spodopierajhigiperda (fall armyworm), Agrotis ipsilon (black cutworm), Helicoverpa 
zea (com earworm), Heliothis virescem (tobacco budworm), Spodoptera exigiia (beet 
armyworm), Diatraea grmdiosella (southwestem com borer), Diatraea saccharalis 
(sugarcane borer), Helicoverpa punctigera (native budworm) and Helicoverpa amtigera 
(cotton boUworm) can be controlled by the pesticidal toxins. The pesticidal toxins can be 
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used singly or in combination with other insect control strategies to confer maximal pest 
control efiSciency with minimal environmental impact. 
[0011] According to one aspect, the present invention provides an isolated nucleic acid 
molecule comprising a nucleotide sequence that encodes a toxin that is active against 
insects, wherein the nucleotide sequence: (a) has a compliment that hybridizes to 
nucleotides 1981-2367 of SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04, 1 mM HDTA at 50*^ C. with washing in 0. IXSSC, 0. 1% SDS at 65**C.; or (b) is 
isocoding with the nucleotide sequence of (a); or (c) comprises a consecutive 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide portion 
of a nucleotide sequence of (a) or (b); or (d) has at least 93% sequence identity with SEQ 
ID NO: 1; or (e) encodes an amino acid sequence havmg at least 91% sequence identity 
■ with SEQ ID NO: 2. 

• [0012] In one embodiment of this aspect, the isolated nucleic acid molecule comprises a 
nucleotide sequence that has a compliment that hybridizes to nucleotides 1981-2367 of 
SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50°C. 
with washing in O.IXSSC, 0.1% SDS at 65°C. 
[0013] In another embodiment of this aspect, the isolated nucleic acid molecule 
comprises a nucleotide sequence that is isocoding with a nucleotide sequence having a 
compliment that hybridizes to nucleotides 1981-2367 of SEQ ID NO: 1 in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50°C. with washing in 0. IXSSC, 
0.1%SDSat65°C. 

[0014] In yet another embodiment, the isolated nucleic acid molecule comprises a 

consecutive 20 base pair nucleotide portion identical in sequence to a consecutive 20 base 
pair nucleotide portion of nucleotides 1981-2367 of the nucleotide sequences set forth m 
SEQ ID NO: 1 or SEQ ID NO: 3. 

[0015] In another embodiment, the isolated nucleic acid molecule comprises a nucleotide 
sequence that has at least 75% sequence identity with nucleotides 1981-2367 of SEQ ID 
NO: 1. Preferably, the isolated nucleic acid molecule comprises a nucleotide sequence 
that has at least 85% sequence identity v^th nucleotides 1981-2367 of SEQ ID NO: 1. 
More preferably, the isolated nucleic acid molecule comprises a nucleotide sequence that 
has at least 95% sequence identity with nucleotides 1981-2367 of SEQ ID NO: 1. Even 
more preferably, the isolated nucleic acid molecule comprises a nucleotide sequence that 
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has at least 99% sequence identity with nucleotides 1981-2367 of SEQ ID NO: 1. Most 
preferably, the isolated nucleic acid molecule comprises nucleotides 1981-2367 of SEQ 
roNO: lorSEQIDNO: 3. 

[0016] In another embodunent, the isolated nucleic acid molecule comprises a nucleotide 
sequence that has at least 93% sequence identity with SEQ ID NO: 1. Preferably, the 
isolated nucleic acid molecule comprises a nucleotide sequence that has at least 95% 
sequence identity with SEQ ID NO: 1. More preferably, the isolated nucleic acid 
molecule comprises a nucleotide sequence that has at least 99% sequence identity with 
SEQ ID NO: 1. Most preferably, the isolated nucleic acid molecule comprises nucleotides 
1-2367 of a nucleotide sequence selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 10, SEQ ID NO: 3 1, and SEQ ID NO: 33. 

[0017] In one embodiment of the present invention, the isolated nucleic acid molecule 
encodes a toxin comprising an amino acid sequence with at least 75% identity with amino 
acids 661-788 of the amino acid sequence of SEQ ID NO: 2. Preferably, the isolated 
nucleic acid molecule encodes a toxin comprising an amino acid sequence which has at 
least 85% identity with amino acids 661-788 of the amino acid sequence of SEQ ID NO: 
2. More preferably, the isolated nucleic acid molecule encodes a toxin comprising an 
amino acid sequence which has at least 95% identity with amino acids 661-788 of the 
amino acid sequence of SEQ ID NO: 2. Even more preferably, the isolated nucleic acid 
molecule encodes a toxin comprising an amino acid sequence which has at least 99% 
identity with amino acids 661-788 of the amino acid sequence of SEQ ID NO: 2. Most 
preferably, the isolated nucleic acid molecule encodes a toxin comprising amino acids 
661-788 of SEQ ID NO: 2. 

[0018] In another embodiment, the isolated nucleic acid molecule encodes a toxin 
comprising an amino acid sequence which has at least 91% identity to the amino acid 
sequence set forth in SEQ ID NO: 2. Preferably, the isolated nucleic acid molecule 
encodes a toxin comprising an amino acid sequence which has at least 95% identity to the 
amino acid sequence set forth in SEQ ID NO: 2, More preferably, the isolated nucleic 
acid molecule encodes a toxin comprising an amino acid sequence which has at least 99% 
identity to tiie amino acid sequence set forth in SEQ ID NO: 2. Most preferably, the 
isolated nucleic acid molecule encodes a toxin comprising the amino acid sequence set 
forth in SEQ ID NO: 2 or SEQ ID NO: 12. 
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[0019] In one embodiment, the isolated nucleic acid molecule is comprised in a Bacillus 
thuringiemis isolate selected from the group consisting of C1674, designated NRRL 
accession B-30556; and C536, designated NRRL accession B-30557. 

[0020] In another embodiment, the isolated nucleic acid molecule comprises the 
approximately 2.4 kb DNA fragment comprised in an £1 coli clone selected from the 
group consisting of pNOV3910, designated NRRL accession B-30553; pNOV391 1, 
designated l^IRRL accession B-30552; pNOV3906, designated NRRL accession B-30555; 
pNOV3905, designated NRRL accession B-30554; and pNOV39I2, designated NRRL 
accession B-30551. 

[0021] According to one embodiment of the invention, the isolated nucleic acid molecule 
encodes a toxin that is active against a lepidopteran insect. Preferably, according to this 
embodiment, the toxin has activity against Ostrinia mbilalis (European com borer), 
Plutella xylostella (diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis 
ipsilon (black cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pmk boll 
worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

[0022] The present invention also provides a chimeric gene comprising a heterologous 
promoter sequence operatively linked to the nucleic acid molecule of the invention. 
Further, the present invention provides a recombinant vector oomprismg such a chimeric 
gene. Still further, the present invention provides a transgenic host cell comprising such a 
chimeric gene. A transgenic host cell according to this aspect of the invention may be an 
animal cell, an animal virus, a plant virus, a bacterial cell, a yeast cell or a plant cell, 
preferably, a plant cell. Even further, the present invention provides a transgenic plant 
comprising such a plant cell. A transgenic plant according to this aspect of the invention 
may be sorghum, wheat, sunflower, tomato, cole crops, cotton, rice, soybean, sugar beet, 
sugarcane, tobacco, barley, oilseed rape or maize, preferably maize and cotton. Still 
further, the present invention provides seed from the group of transgenic plants consisting 
of sorghum, wheat, sunflower, tomato, cole crops, cotton, rice, soybean, sugar beet, 
sugarcane, tobacco, barley, oilseed rq)e and maize. In a particularly preferred 
embodiment, the seed is from a transgenic maize plant or transgenic cotton plant. 
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[0023] Also provided by the invention are transgenic plants of the invention fbrther 
comprising a second nucleic acid sequence or groups of nucleic acid sequences that 
encode a second pesticidal principle. Particularly preferred second nucleic acid sequences 
are those that encode a 5-endotoxin, those that encode another Vegetative Insecticidal 
Protein toxin or those that encode a pathway for the production of a non-proteinaceous 
pesticidal principle. 

[0024] In yet another aspect, the present invention provides toxins produced by the 
expression of the nucleic acid molecules of the present invention, 

[0025] In a preferred embodiment, the toxin is produced by the expression of the nucleic 
acid molecule comprising nucleotides 1-2367 of a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, and SEQ ID NO: 10. 

[0026] In another embodiment, the toxins of the invention are active against lepidopteran 
insects, preferably against Ostrmia nubilalis (European com borer), Plutella xylostella 
(diamondback moth), Spodopiera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworai), Heliothis virescens (tobacco budworm), 
Spodoptera exigua (beet amiyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

[0027] In one embodiment, the toxins of the present invention are produced by a Bacillus 
thuringiensis isolate selected from the group consisting of CI 674, designated NKRL 
accession B-30556; and C536, designated NRRL accession B-30557. 

[0028] In another embodiment, the toxins are produced by an E. coli clone selected from 
the group consisting of pNOV3910, designated NRRL accession B-305S3; pNOV3911, 
designated NRRL accession B-30552; pNOV3906, designated NRRL accession B-30555; 
pNOV3905, designated NRRL accession B-30554; and pNOV3912, designated NRRL 
accession B-30S51 . 

[0029] In one embodiment, a toxin of the present invention comprises an amino acid 
sequence which has at least 75% identity with ammo acids 661-788 of the amino acid 
sequence of SEQ ID NO: 2. Preferably, the toxin comprises an amino acid sequence 
which has at least 85% identity with ammo acids 661-788 of the amino acid sequence of 
SEQ ID NO: 2. More preferably, the toxin comprises an amino acid sequence which has 
at least 95% identity v^th amino acids 661-788 of the amino acid sequence of SEQ ID 
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NO: 2. Even more preferably, the toxin conoprises an amino acid sequence that has at 
least 99% identity with amino acids 661-788 of the amino acid sequence of SEQ ID NO: 
2. Most preferably, the toxin comprises ammo acids 661-788 of SEQ ID NO: 2. 

[0030] In another embodiment, a toxin of the present invention comprises an amino acid 
sequence which has at least 91% identity with the amino acid sequence set forth in SEQ 
ID NO: 2. Preferably, the toxin comprises an ammo acid sequence which has at least 95% 
identity with the amino acid sequence set forth m SEQ ID NO: 2. More preferably, the 
toxin comprises an amino acid sequence which has at least 99% identity with the amino 
acid sequence set forth in SEQ ID NO: 2. Most preferably, the toxin comprises the amino 
acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 1 1, or SEQ ID NO: 32y 

[0031] The present invention also provides a composition comprising an effective insect- 
controlling amount of a toxin according to the invention. 

[0032] . In another aspect, the present invention provides a method of producing a toxin 
that is active against insects, comprising: (a) obtaining a transgenic host cell comprising 
a chimeric gene, which itself comprises a heterologous promoter sequence operatively 
linked to the nucleic acid molecule of the invention; and (b) expressing the nucleic acid 
molecule in the transgenic cell, which results in at least one toxin that is active against 
insects. 

[0033] In a further aspect, the present invention provides a method of producing an 
insect-resistant transgenic plant, comprising introducing a nucleic acid molecule of the 
invention into the transgenic plant, wherein the nucleic acid molecule is expressible in the 
transgenic plant in an effective amount to control insects. According to one embodiment, 
the insects are lepidopteran insects, preferably selected from the group consisting of: 
Ostrinia nubilalis (European com borer), Plutella xylostella (diamondback moth), 
Spodopterafrugiperda (fall armyworm), Agrotis ipsilon (black cutworm), Helicoverpa 
zea (com earworm), Heliothis virescem (tobacco budworm), Spodoptera exigua (beet 
armyworm), Pectinophora gossypiella (pink boll worm), Trichoplvsia ni (cabbage 
looper), Cochyles hospes (banded sunflower moth), and Homoeosoma electellum 
(sunflower head moth). 

[0034] In still a further aspect, the present invention provides a method of controlling 
insects comprising delivering to the insects an effective amount of a toxin of the present 
invention. According to one embodiment, the insects are lepidopteran insects, preferably 
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selected from the group consisting of: Ostrinia mbilalis (European com borer), Plutella 
xylostella (diamondback moth), Spodoptera fnigiperda (fall armyworm), Agrotis ipsilon 
(black cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). Preferably, the toxin is delivered to the 
insects orally. In one preferred embodiment, the toxin is delivered orally through a 
transgenic plant comprising a nucleic acid sequence that expresses a toxin of the present 
invention. 

[0035] The present invention also provides hybrid toxins active against insects, wherein 
the hybrid toxins are encoded by a nucleic acid molecule comprising a nucleotide 
sequence according to the invention. 

[0036] In one embodiment, the hybrid toxins of the invention are active against 

lepidopteran insects, preferably against Ostrinia nubilalis (European com borer), Plutella 
xylostella (diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon 
(black cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

[0037] In another embodiment, the hybrid toxin is encoded by the approximately 2.4 kb 
DNA fragment comprised in the E. coli clone pNOV3912, designated NRRL accession 
B-30551. In a preferred embodiment, the hybrid toxin is encoded by the nucleotide 
sequence set forth in SEQ ID NO: 10. 

[0038] The present invention also provides a composition comprising an insecticidally 
effective amount of a hybrid toxin according to the invention. 

[0039] In another aspect, the present invention provides a method of producing a hybrid 
toxin active against insects, comprising: (a) obtaining a transgenic host cell comprising a 
chimeric gene, which itself comprises a heterologous promoter sequence operatively 
linked to the nucleic acid molecule of the invention; and (b) expressing the nucleic acid 
molecule in the transgenic cell, which results in at least one hybrid toxm that is active 
against insects. 
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[0040] In a fiuther aspect, the present invention provides a method of producing an 
insect-resistant transgenic plant, comprising introducing a nucleic acid molecule of the 
invention into the plant, wherein the nucleic acid molecule encodes a hybrid toxin and 
wherein the hybrid toxin is expressible in the transgenic plant in an effective amount to 
control an insect. According to one embodunent, the insects are lepidopteran insects, 
preferably selected from the group consisting of Ostrinia mhilalis (European com borer), 
Plutella yylostetta (diamondback moth), Spodoptera Jrugiperda (fall armyworm), Agrotis 
ipsilon (black cutworm), Helicoverpa zea (com earworm), Heliothis virescem (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worm), Trichoplnsia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

[0041] In still a fiither aspect, the present invention provides a method of controlling an 
insect comprising delivering to the insects an effective amount of a hybrid toxin of the 
present invention. According to one embodiment, the insects are lepidopteran insects, 
preferably selected from the group consisting of Ostrinia mhilalis (European com borer), 
Plutella xylostella (diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis 
ipsilon (black cutworm), Helicoverpa zea (com earworm), Heliothis virescem (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). Preferably the hybrid toxin is delivered 
to the insects orally. In one preferred embodiment, the hybrid toxin is delivered orally 
through a transgenic plant comprising a nucleic acid sequence that expresses a hybrid 
toxin of the present invention. 

[0042] The present invention also provides a hybrid toxin active against insects, 
comprising a carboxy-terminal region of a Vip3 toxin joined in the amino to carboxy 
direction to an amino-terminal region of a different Vip3 toxin, wherein the carboxy- 
terminal region comprises an amino acid sequence which has at least 75% identity, 
preferably at least 85% identity, more preferably at least 95% identity, most preferably at 
least 99% identity with amino acids 661-788 of SEQ ID NO: 2; and wherein the amino- 
terminal region has at least 75% identity, preferably at least 85% identity, more 
preferably at least 95% identity, most preferably at least 99% identity with amino acids 1- 
660 of SEQ ID NO: 5. In a preferred embodiment, the carboxy-terminal region comprises 
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amino acids 661-788 of SEQ ID NO: 2, and the amino-terminal region comprises amino 
acids 1-660 of SEQ ID NO: 5. In a most preferred embodiment, the hybrid toxin 
comprises amino acids 1-788 of SEQ ID NO: 1 1. 

[0043] The hybrid toxin, according to this aspect of the invention, is preferably active 
against lepidopteran insects, more preferably against lepidopteran insects selected from 
the group consisting ofOstrinia mbilalis (European com borer), Plutella xylostella 
(diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodoptera exigiia (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

[0044] Also encompassed by this aspect of the invention is a nucleic acid molecule 
comprising a nucleotide sequence that encodes the hybrid toxin of this aspect. 

[0045] The invention further provides a method of controlling msects wherein a 
transgenic plant comprising a hybrid toxin of the invention further comprises a second 
nucleic acid sequence or groups of nucleic acid sequences that encode a second pesticidal 
principle. Particularly preferred second nucleic acid sequences are those that encode a 5- 
endotoxin, those that encode another Vegetative Insecticidal Protein toxin or those that 
encode a pathway for the production of a non-proteinaceous pesticidal principle. 

[0046] Yet another aspect of the present invention is the provision of a method for 
• mutagenizing a nucleic acid molecule according to the present invention, wherein the 
nucleic acid molecule has been cleaved into populations of double-stranded random 
fragments of a desired size, comprising: (a) adding to the population of double-stranded 
random fragments one or more single- or double-stranded oligonucleotides, wherein the . 
oligonucleotides each comprise an area of identity and an area of heterology to a double- 
stranded template polynucleotide; (b) denaturing the resultant mixture of double-stranded 
random fragments and oligonucleotides into single-stranded fragments; (c) incubating the 
resultant population of single-stranded fragments with polymerase under conditions 
which result in the annealmg of the single-stranded fragments at the areas of identity to 
form pairs of annealed fragments, the areas of identity being sufficient for one member of 
the pair to prime replication of the other, thereby forming a mutagenized double-stranded 
polynucleotide; and (d) repeating the second and third steps for at least two fiirther cycles, 
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wherein the resultant mixture in the second step of a fiirther cycle includes the 
mutagenized double-stranded polynucleotide from the third step of the previous cycle, 
and wherein the fiirther cycle forms a fiirther mutagenized double-stranded 
polynucleotide. 

[0047] Other aspects and advantages of the present invention will become apparent to 
those skilled in the art from a study of the following description of the invention and non- 
limiting examples. 

BRIEF DESCJOPTION OF THE SEQUENCES IN THE SEQUENCE LISTING 

SEQ ID NO: 1 is a native vipiC nucleotide sequence. 

SEQ E) NO: 2 is the amino acid sequence encoded by SEQ ID NO: 1. 

SEQ ID NO: 3 is a maize optimized v?p3C nucleotide sequence. 

SEQ ID NO: 4 is a native vjfpB A(a) nucleotide sequence. 

SEQ ID NO: S is the amino acid sequence encoded by SEQ ID NO: S. 

SEQ ID NO: 6 is a native V7]p3B nucleotide sequence. 

SEQ ID NO: 7 is the amino acid sequence encoded by SEQ ID NO: 7. 

SEQ ID NO: 8 is a native yip3Z nucleotide sequence. 

SEQ ID NO: 9 is the amino acid sequence encoded by SEQ ID NO: 9. 

SEQ ID NO: 10 is a hybrid vipiArC nucleotide sequence. 

SEQ ID NO: 1 1 is the amino acid sequence encoded by SEQ ID NO: 1 1. 

SEQ ID NO: 12-29 are primer sequences useful in practicing the invention. 

SEQ ID NO: 30 is the nucleotide sequence of the vector pNOV2149. 

SEQ ID NO: 31 is the v/p3C-12168 nucleotide sequence. 

SEQ ID NO: 32 is the amino acid sequence encoded by SEQ ID NO: 32. 

SEQ ID NO: 33 is the maize optimized vi/73C-12168 nucleotide sequence. 



The following material was deposited with the Agricultural Research Service, Patent 
Culture Collection (NRRL), 1815 North University Street, Peoria, Illinois 61604, under the 
terms of the Budapest Treaty on the International Recognition of the Deposit of 
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Microorgamsms for the Purposes of Patent Procedure. All restrictions on the availability of 
the deposited material will be irrevocably removed upon granting of the patent. 



Isolate/Clone 
BX strain C1674 
B.t strain C536 
£co//BL21(pNOV3906) 
£c:o/zBL21(pNOV3905) 
co//DH5a(pNOV3910) 
£.co//DH5a(pNOV3911) 
E. coliBHSa (pNOV3912) 



Accession Number 
NRRLB-30556 
NRRLB-30557 
NRRLB.30555 
NRRLB-30554 
NRRLB-30553 
NRRLB-30552 
NRRLB-30551 



Date of Deposit 
February 7, 2002 
February 7, 2002 
February 7, 2002 
February 7, 2002 
February 7, 2002 
February 7, 2002 
February 7, 2002 



DEFINrnONS 

[0048] "Activity" of the toxins of the invention is meant that the toxins function as orally 
active insect control agents, have a toxic effect, or are able to disrupt or deter insect 
feeding, which may or may not cause death of the insect. When a toxin of the invention is 
delivered to the insect, the result is typically death of the insect, or the insect does not 
feed upon the source that makes the toxin available to the insect. 

[0049] "Associated with / operatively linked" refer to two nucleic acid sequences that are 
related physically or functionally. For example, a promoter or regulatory DNA sequence 
is said to be "associated with" a DNA sequence that codes for an RNA or a protein if the 
two sequences are operatively linked, or situated such that the regulator DNA sequence 
will affect the expression level of the coduig or structural DNA sequence. 

[0050] A "chimeric gene" is a recombinant nucleic acid sequence m which a promoter or 
r^latory nucleic acid sequence is operatively linked to, or associated with, a nucleic 
acid sequence that codes for an mRNA or which is e?q)ressed as a protein, such that the 
regulator nucleic acid sequence is able to regulate transcription or expression of the 
associated nucleic acid sequence. The regulator nucleic acid sequence of the chimeric 
gene is not normally operatively linked to the associated nucleic acid sequence as found 
in nature. 
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[0051] A '•coding sequence" is a nucleic acid sequence that is transcribed into RNA such 
as mRNA, rRNA, tRNA, snRNA, sense KNA or antisense RNA. Preferably the RNA is 
then translated in an organism to produce a protein. 

[0052] To "control" insects means to inhibit, through a toxic efifect, the ability of insect 
pests to survive, grow, feed, and/or reproduce, or to limit insect-related damage or loss in 
crop plants. To "control" insects may or may not mean killing the insects, although it 
preferably means killing the insects, 

[0053] Corresponding to: in the context of the present mvention, "corresponding to" or 
"corresponds to" means that when the nucleic acid coding sequences or amino acid 
sequences of different Vip3 genes or proteins are aligned with each other, the nucleic or 
amino acids that "correspond to" certain enumerated positions are those that align with 
these positions but that are not necessarily in these exact numerical positions relative to 
the particular Vip3's respective nucleic acid coding sequence or amino acid sequence. 
Likewise, when the coding or amino acid sequence of a particular Vip3 (for example, 
VipSZ) is aligned with the coding or amino acid sequence of a reference Vip3 (for 
example, Vip3C), the nucleic acids or amino acids in the Vip3Z sequence that 
"correspond to" certain enumerated positions of the Vip3C sequence are those that align 
with these positions of the Vip3C sequence, but are not necessarily in these exact 
numerical positions of the Vip3Z protein's respective nucleic acid coding sequence or 
amino acid sequence. 

[0054] To "deliver" a toxin means that the toxin comes in contact with an insect, resulting 
in toxic effect and control of the insect. The toxin can be delivered in many recognized 
ways, e.g., orally by ingestion by the insect or by contact with the insect via transgenic 
plant expression, formulated protein composition(s), sprayable protein composition(s), a 
bait matrix, or any other art-recognized toxin delivery system. 

[0055] "Effective insect-controlling amount" means that concentration of toxin that 
inhibits, through a toxic effect, the ability of insects to survive, grow, feed and/or 
reproduce, or to limit insect-related damage or loss in crop plants. "Eflfective insect- 
controlling amount" may or may not mean killing the insects, although it preferably 
means killing the msects. 

[0056] "Expression cassette" as used herein means a nucleic acid sequence capable of 
directing expression of a particular nucleotide sequence m an appropriate host cell. 
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comprising a promoter operably linked to the nucleotide sequence of interest which is 
operably linked to termination signals. It also typically comprises sequences required for 
proper translation of the nucleotide sequence. The ^ression cassette comprising the 
nucleotide sequence of interest may be chimeric, meaning that at least one of its 
components is heterologous with respect to at least one of its other components. The 
expression cassette may also be one that is naturally occurring but has been obtained in a 
recombinant form useful for heterologous expression. Typically, however, the expression 
cassette is heterologous with respect to the host, i.e., the particular nucleic acid sequence 
of the expression cassette does not occur naturally in the host cell and must have been 
introduced into the host cell or an ancestor of the host cell by a transformation event. The 
expression of the nucleotide sequence in the expression cassette may be under the control 
of a constitutive promoter or of an inducible promoter that initiates transcription only 
when the host cell is exposed to some particular external stimulus. In the case of a 
multicellular organism, such as a plant, the promoter can also be specific to a particular 
tissue, or organ, or stage of development. 

[0057] A "gene" is a defined region that is located within a genome and that, besides the 
aforementioned coding nucleic acid sequence, comprises other, primarily regulatory, 
nucleic acid sequences responsible for the control of the expression, that is to say the 
transcription and translation, of the coding portion. A gene may also comprise other 5' 
and 3* untranslated sequences and termination sequences. Further elements that may be 
present are, for example, introns. 

[0058] "Gene of interest" refers to any gene which, when transferred to a plant, confers 
upqn the plant a desired characteristic such as antibiotic resistance, virus resistance, insect 
resistance, disease resistance, or resistance to other pests, herbicide tolerance, improved 
nutritional value, improved performance in an industrial process or altered reproductive 
capability. The "gene of interest" may also be one that is transferred to plants for the 
production of commercially valuable enzymes or metabolites in the plant. 

[0059] A "heterologous" nucleic acid sequence is a nucleic acid sequence not naturally 
associated with a host cell into which it is introduced, including non- naturally occurring 
muhiple copies of a naturally occurring nucleic acid sequence. 

[0060] A "homologous" nucleic acid sequence is a nucleic acid sequence naturally 
associated with a host cell into which it is introduced. 
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[0061] "Homologous recombination" is the reciprocal exchange of nucleic acid fragments 
between homologous nucleic acid molecules. 

[0062] **Hybrid toxin" as used herein is an insecticidal toxin made by the hand of man 
which comprises amino acid regions or fragments of one toxin joined with amino acid 
regions or fragments from a different toxin. For example, without limitation, joining the 
C-terminal region of Vip3C, from amino acids 661-788 of SEQ ID NO: 2, with the N- 
terminal region of Vip3 A, from amino acid 1-660 of SEQ ID NO: 4, creates a hybrid 
toxin with an amino acid sequence set forth in SEQ ID NO: 11. 

[0063] "Insecticidal" is defined as a toxic biological activity capable of controlling 
insects, preferably by killing them 

[0064] A nucleic acid sequence is "isocoding with" a reference nucleic acid sequence 
when the nucleic acid sequence encodes a polypeptide having the same amino acid 
sequence as the polypeptide encoded by the reference nucleic acid sequence. 

[0065] An "isolated" nucleic acid molecule or an isolated protein or toxin is a nucleic 
acid molecule or protein or toxin that, by the hand of mflti^ exists apart from its native 
environment and is therefore not a product of nature. An isolated nucleic acid molecule or 
protein or toxin may exist in a purified form or may exist in a non-native environment 
such as, for example, a recombinant host cell or a transgenic plant. 

[0066] Native: refers to a gene that is present in the genome of an untransformed cell. 

[0067] Naturally occurring: the term "naturally occurring" is used to describe an object 
that can be found in nature as distinct from being artificially produced by man. For 
example, a protein or nucleotide sequence present in an organism (including a virus), 
which can be isolated from a source in nature and which has not been intentionally 
modified by man in the laboratory, is naturally occurring. 

[0068] A "nucleic acid molecule" or "nucleic acid sequence" is a linear segment of 
single- or double-stranded DNA or RNA that can be isolated from any source. In the 
context of the present invention, the nucleic acid molecule is preferably a segment of 
DNA. 

[0069] A "plant" is any plant at any stage of development, particularly a seed plant. 
[0070] A "plant cell" is a structural and physiological unit of a plant, comprising a 
protoplast and a cell wall. The plant cell may be in form of an isolated single cell or a 
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cultured cell, or as a part of higher organized unit such as, for example, plant tissue, a 

plant organ, or a whole plant. 
[0071] "Plant cell culture" means cultures of plant units such as^ for example, protoplasts, 

cell culture cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes 

and embryos at various stages of development. 
[0072] "Plant material" refers to leaves, stems, roots, flowers or flower parts, fruits, 

pollen, egg cells, zygotes, seeds, cuttings, cell or tissue cultures, or any other part or 

product of a plant. 

[0073] A "plant organ" is a distinct and visibly structured and differentiated part of a 

plant such as a root, stem, leaf, flower bud, or embryo. 

[0074] "Plant tissue" as used herein means a group of plant cells organized into a 

structural and functional unit. Any tissue of a plant in planta or in culture is included. This 
term includes, but is not limited to, whole plants, plant organs, plant seeds, tissue culture 
and any groups of plant cells organized into structural and/or functional units. The use of 
this term in conjunction with, or in the absence of, any specific type of plant tissue as 
listed above or otherwise embraced by this definition is not intended to be exclusive of 
any other type of plant tissue. 

[0075] A "promoter" is an untranslated DNA sequence upstream of the coding region that 
contains the binding site for RNA polymerase 1 1 and initiates transcription of the DNA. 
The promoter region may also include other elements that act as regulators of gene 
expression. 

[0076] A "protoplast" is an isolated plant cell without a cell wall or with only parts of the 
cell wall, 

[0077] "Regulatory elements" refer to sequences involved in controlling the expression of 
a nucleotide sequence. Regulatory elements comprise a promoter operably linked to the 
nucleotide sequence of interest and termination signals. They also typically encompass 
sequences required for proper translation of the nucleotide sequence. 

[0078] A "shuffled" nucleic acid is a nucleic acid produced by a shuffling procedure such 
as any shuffling procedure set forth herein. Shuffled nucleic acids are produced by 
recombining (physically or virtually) two or more nucleic acids (or character strings), 
e.g., in an artificial, and optionally recursive, fashion. Generally, one or more screening 
steps are used in shuffling processes to identify nucleic acids of interest; this screening 
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Step can be performed before or after any recombination step. In some (but not all) 
shuffling embodiments, it is desirable to perform multiple rounds of recombination prior 
to selection to increase the diversity of the pool to be screened. The overall process of 
recombination and selection are optionally repeated recursively. Depending on context, 
shuffling can refer to an overall process of recombination and selection, or, alternately, 
can simply refer to the recombmational portions of the overall process. 

[0079] Substantially identical: the phrase "substantially identical," in the context of two 
nucleic acid or protein sequences, refers to two or more sequences or subsequences that 
have at least 60%, preferably 80%, more preferably 90, even more preferably 95%, and 
most preferably at least 99% nucleotide or amino acid residue identity, when compared 
and aligned for maximum correspondence, as measured using one of the following 
sequence comparison algorithms or by visual inspection. Preferably, the substantial 
identity exists over a region of the sequences that is at least about 50 residues in length, 
more preferably over a region of at least about 100 residues, and most preferably the 
sequences are substantially identical over at least about 150 residues. In an especially 
preferred embodiment, the sequences are substantially identical over the entire length of 
the codmg regions. Furthermore, substantially identical nucleic acid or protein sequences 
perform substantially the same function, 

[0080] For sequence comparison, typically one sequence acts as a reference sequence to 
which test sequences are compared. When usmg a sequence comparison algorithm, test 
and reference sequences are input into a computer, subsequence coordinates are 
designated if necessary, and sequence algorithm program parameters are designated. The 
sequence comparison algorithm then calculates the percent sequence identity for the test 
sequence(s) relative to the reference sequence, based on the designated program 
parameters. 

[0081] Optimal alignment of sequences for comparison can be conducted, e.g., by the 
local homology algorithm of Smith & Waterman, Adv. Appl Math 2: 482 (1981), by the 
homology alignment algorithm of Needleman & Wunsch, 1 Mol Biol 48: 443 (1970), by 
the search for similarity method of Pearson & Lipman, Proc, Natl Acad Set USA 85: 
2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Sdence Dr., Madison, WI), or by visual inspection {see generally, Ausubel et 
al, ifrfra). 

[0082] One example of an algorithm that is suitable for determining percent sequence 
identity and sequence similarity is the BLAST algorithm, which is described in Altschul 
et al, J. MoL BioL 215: 403-410 (1990). Software for performing BLAST analyses is 
publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, 
which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood 
word score threshold (Altschul et al, 1990). These initial neighborhood word hits act as 
seeds for initiating searches to fmd longer HSPs containing them. The word hits are then 
extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Cumulative scores are calculated using, for nucleotide sequences, 
the parameters M (reward score for a pair of matching residues; always > 0) and N 
(penalty score for mismatching residues; always < 0). For amino acid sequences, a 
scoring matrix is used to calculate the cumulative score. Extension of the word hits in 
each direction are halted when the cumulative alignment score falls off by the quantity X 
from its maximum achieved value, the cumulative score goes to zero or below due to the 
accumulation of one or more negative-scoring residue alignments, or the end of either 
sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, a cutoff of 100, M=5, 
N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP program 
uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 
scoring matrix (see HenikofF& Henikof^ Proc. NatL Acad Set USA 89: 10915 (1989)). 

[0083] In addition to calculating percent sequence identity, the BLAST algorithm also 
performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & 
Altschul, Proc. Natl Acad, Sci. USA 90: 5873-5787 (1993)). One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides 
an indication of the probability by which a match between two nucleotide or amino acid 
sequences would occur by chance. For example, a test nucleic acid sequence is 
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considered similar to a reference sequence if the smallest sum probability m a comparison 
of the test nucleic acid sequence to the reference nucleic acid sequence is less than about 
0.1, more preferably less than about 0.01, and most preferably less than about 0.001. 
[0084] Another indication that two nucleic acid sequences are substantially identical is 
that the two molecules hybridize to each other under stringent conditions. The phrase 
"hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule 
only to a particular nucleotide sequence under stringent conditions when that sequence is 
present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially" 
refers to complementary hybridization between a probe nucleic acid and a target nucleic 
acid and embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired detection of the target nucleic 
acid sequence. 

[0085] "Stringent hybridization conditions" and "stringent hybridization wash conditions" 
in the context of nucleic acid hybridization experiments such as Southern and Northern 
hybridizations are sequence dependent, and are different under different environmental . 
parameters. Longer sequences hybridize specifically at higher temperatures. An extensive 
guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory 
Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid 
Probes part I chapter 2 "Overview of principles of hybridization and the strategy of 
nucleic acid probe assays" Elsevier, New York. Generally, highly stringent hybridization 
and wash conditions are selected to be about 5°C lower than the thermal melting point 
(Tm) for the specific sequence at a defined ionic strength and pH. Typically, under 
"stringent conditions" a probe will hybridize to its target subsequence, but to no other 
sequences. 

[0086] The Tm is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are 
selected to be equal to the Tm for a particular probe. An example of stringent 
hybridization conditions for hybridization of complementary nucleic acids which have 
more than 100 complementary residues on a filter in a Southern or northern blot is 50% 
formamide with 1 mg of heparin at 42®C, with the hybridization being carried out 
overnight. An example of highly stringent wash conditions is 0.1 5M NaCl at 72°C for 
about 15 minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65**C 



20 



wo 03/075655 



PCTAJS03/04735 



for 15 minutes {see, Sambrook, brfra, for a description of SSC buffer). Often, a high 
stringency wash is preceded by a low stringency wash to remove background probe 
signal. An example medium stringency wash for a duplex o^ e.g., more than 100 
nucleotides, is Ix SSC at 45°C for 15 minutes. An example low stringency wash for a 
duplex 0^ e.g., more than 100 nucleotides, is 4-6x SSC at 40^C for 15 minutes. For short 
probes {e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt 
concentrations of less than about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion 
concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least 
about 30°C. Stringent conditions can also be achieved with the addition of destabilizing 
agents such as formamide. In general, a signal to noise ratio of 2x (or higher) than that 
observed for an unrelated probe in the particular hybridization assay indicates detection of 
a specific hybridization. Nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially identical if the proteins that they encode are substantially 
identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum 
codon degeneracy permitted by the genetic code. 
[0087] The following are examples of sets of hybridization/wash conditions tiiat may be 
used to done homologous nucleotide sequences that are substantially identical to 
reference nucleotide sequences of the present invention: a reference nucleotide sequence 
preferably hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate 
(SDS), 0.5 M NaP04, 1 mM EDTA at 50T witii washing in 2X SSC, 0. 1% SDS at 50T, 
more desirably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50T 
with washing in IX SSC, 0. 1% SDS at 50T, more desirably still in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50T with washing in 0.5X SSC, 0. 1% SDS 
at 50^C, preferably in 7% sodium dodecyl sulfate (SDS), 0.5 MNaP04, 1 mM EDTA at 
50*^C with washing in O.IX SSC, 0.1% SDS at 50*^C, more preferably in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50°C witii washing m 0. IX SSC, 
0.1%SDSat65X. 

[0088] A further indication that two nucleic acid sequences or proteins are substantially 
identical is that the protein encoded by the first nucleic acid is immunologically cross 
reactive with, or specifically binds to, the protein encoded by the second nucleic acid. 
Thus, a protein is typically substantially identical to a second protein, for exanq)le, where 
the two proteins differ only by conservative substitutions. 
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[0089] "Synthetic** refers to a nucleotide sequence comprising structural characters that 
are not present in the natural sequence. For example, an artificial sequence that resembles 
more closely the G+C content and the normal codon distribution of dicot and/or monocot 
genes is said to be synthetic. 

[0090] "Transformation'* is a process for introdudng heterologous nucleic acid into a host 
cell or organism. In particular, "transformation" means the stable integration of a DNA 
molecule into the genome of an organism of mterest. 

[0091] "Transformed / transgenic / recombinant" refer to a host organism such as a 
bacterium or a plant into which a heterologous nucleic acid molecule has been introduced. 
The nucleic acid molecule can be stably integrated into the genome of the host or the 
nucleic acid molecule can also be present as an extrachromosomal molecule. Such an 
extrachromosomal molecule can be auto-replicating. Transformed cells, tissues, or plants 
are understood to encompass not only the end product of a transformation process, but 
also transgenic progeny thereof A "non-transformed", "non-transgenic", or "non- 
recombinant" host refers to a wild-type organism, e.g., a bacterium or plant, which does 
not contain the heterologous nucleic acid molecule. 

[0092] The "Vip3 class of proteins" comprises Vip3 A(a), Vip3A(b), Vip3A(c), Vip3B, 
Vip3C(a), Vip3C(b), Vip3Z, and their homologues. "Homologue" is used throughout to 
mean that the indicated protein or polypeptide bears a defined relationship to other 
members of the Vip3 class of proteins. This defined relationship includes but is not 
limited to, 1) proteins which are at least 70%, more preferably at least 80% and most 
preferably at least 90% identical at the sequence level to another member of the Vip3 
class of proteins while also retaining pesticidal activity, 2) proteins which are cross- 
reactive to antibodies which immunologically recognize another member of the Vip3 
class of proteins, 3) proteins which are cross-reactive with a receptor to another member 
of the Vip3 class of proteins and retain the ability to induce programmed cell death, and 
4) proteins which are at least 70%, more preferably at least 80% and most preferably at 
least 90% identical at the sequence level to the toxic core region of another member of the 
Vip3 class of proteins while also retaining pesticidal activity. Other Vip3 homologues 
have been disclosed in WO 98/18932, WO 98/33991, WO 98/00546, and WO 99/57282. 

[0093] Nucleotides are indicated by thdr bases by the following standard abbreviations: 
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adenine (A), cytosine (C), thymine (T), and guanine (G). Amino acids are likewise 
indicated by the following standard abbreviations: alanine (Ala; A), arginine (Arg; R), 
asparagine (Asn; N), aspartic acid (Asp; D), cysteine (Cys; C), glutamine (Gin; Q), 
glutamic acid (Glu; E), glycine (Gly; G), histidine (His; H), isoleucine (lie; 1), leucine 
(Leu; L), lysine (Lys; K), methionine (Met; M), phenylalanine (Phe; F), proline (Pro; P), 
serine (Ser; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine 
(Val;V). 



[0094] This invention relates to nucleic acid sequences whose ^ression results in novel 
toxins, and to the making and using of the toxins to control insect pests. The nucleic acid 
sequences are derived &om Bacillus, a gram-positive spore-forming microorganism. In 
particular, novel Vip3 proteins, use&l as pesticidal agents, are provided. 

[0095] For purposes of the present invention, insect pests include insects selected from, 
for example, the orders Coleoptera, Diptera, Hymenoptera, Lepidoptera, MaUophaga, 
Homoptera, Hemiptera, Orthroptera, Thysanoptera, Dermaptera, Isoptera, Anoplura, 
Siphonaptera, and Trichoptera, particularly Lepidoptera. 

[0096] Tables 1-7 give a list of pests associated with major crop plants. Such pests are 
included within the scope of the present invention. 



DETAILED DESCRIPTION OF THE INVENTION 



Table 1 



Lepidoptera 



Osthniq nubilalis^ European com borer 
Agrotis ipsilon^ black cutworm 
Heliaoverpa ze^, com eaiworm 
Spodoptera frugfperda^ M armyworm 
Diatraea grandiosella^ southwestern com 



Elasmopalpus lignoselluSy lesser cornstalk 



borer 



borer 



Spodoptera exigua^ beet ann3nYonn 
Pectinophora gossyplella, pink boUworm 
Scirpophaga imotata, white stemborer 
Cnaphalocrocis medinalis, leaffolder 
ChUoplejadeUus,nQestdJkbom . 
Nymphula depunctalis, caseworm 
Spodoptera litura, cutwomi 
Spodoptera mauritia, rice swarming caterpillar 



Diatraea saccharalis, sugarcane borer 
Heliohtis virescerts, cotton bollworm 
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Scirpophaga incertulas, yellow stemborer 
CMo polychrysa^ darkheaded riceborer 
Mythimna separata^ oriental aimywonn 
CMo partelltis, soighum borer 
Feltia subterranea, granulate cutworm 
Homoeosoma electellum^ sunflower head moth 



Cochylis hospes, baaded sunflower moth 
Pseudaletia unipmctata, army worm 
Agrotis orthogoma, pale westem cutworm 
Pseudoplusia includens, soybean looper 
Anticarsia gemmatalis, velvetbean caterpillar 
Plathypem scabra^ green cloverworm 



Table 2 



Coleoptera 



Diabrotica virgifera, westem com rootwonn 
Diabrotica longicomis, northern com 
rootworm 

Diabrotica undecimpunctata, southem com 
rootworm 

Cyclocephala borealis^ northem masked 
chafer (white grub) 

Cyclocephala immaculata^ southem masked 
chafer (white gmb) 
Popillia japonica, Japanese beetle 
Chaetocnema pulicaria, com flea beetle 
Sphenophorus mcddis, maize billbug 



Phyllophaga crinita, white gmb 
Melanotus spp., Eleodes, Conoderus, and 
Aeolus spp,, wireworms 
Oulema melanopus^ cereal leaf beetle 
Chaetocnema pulicaria, com flea beetle 
Oulema melanopus, cereal leaf beetle 
Hypera ptoTCtata, clover leaf weevil 
Anthonomus grandis^ boll weevil 
Colaspis brumea, grape oolaspis 
Lissorhoptrus oryzophilus^ rice water weevil 
Sitophilus oryzae^ rice weevil 
Epilachna varivestis, Mexican bean beetle 



Table 3 
Homoptera 



Rhopalosiphum maidis, com leaf ^hid 
Amraphis maidiradicis, com root aphid 
Sipha Jlava, yellow sugarcane aphid 
Schizaphis gramimm, greenbug 
Macrosiphum avenae, English grain aphid 
Aphis gossypii, cotton aphid 



Pseudatomoscelis seriatuSy cotton fleahopper 
Trialeurodes abutilonea, bandedwinged 
whitefly 

Nephotettix nigropictus, rice leafhopper 
Myziis persicae, green peach aphid 
Empoasca fabae, potato leafhopper 
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Table 4 



Hemiptera 



Blissus leucopterus leucopterus, chinch bug Acrostemum hilare, green stink bug 

Lygus Ifneolahs, tarnished plant bug Euschistus servusMoyfn stink bug 



Tables 

Orthroptera 

Melanoplus femurrubrum, redlegged grasshopper 

Mebmoplus sanguinipes, migratory grasshopper 

Melanoplus differentialis, differential grasshopper 



Table 6 



Diptera 



Hylemya platura^ seedcom maggot 
Agromyza parvicomis, com blotch leafininer 
Contarirda sorghicola, sorghum midge 
Mayetiola destructor^ Hessian fly 
Sitodiplosis mosellana, vbeat midge 



Meromyza americana^ ^heat stem maggot 
Hylemya coarctata^ wheat bulb fly 
Neolasioptera murtfeldticma^ sunflower seed 
midge 



Table? 

Thysanoptera 

Anaphothrips obscurus, grass thrips 

FrarMiniella fusca, tobacco thrips 

Thrips tabaci, onion thrips 

Sericothrips variabilis, soybean thrips 



[0097] The expression of the nucleic acid sequences of the present invention results in 
toxins that can be used to control lepidopteran insects, for example, without limitation, 
Ostrinia nubilalis (European com borer), Phitellaxylostella (diamondback moth), 
Spodoptera fnigiperda (fall armyworm), Agrotis ipsilon (black cutworm), Helicoverpa 
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zea (corn eanvoimX Heliothis virescem (tobacco budwonn), Spodoptera adgua (beet 
armyworm), Pectinophora gossypielh (pink boll worm), Trichoplusla ni (cabbage 
looper), Cochyles hospes (banded sunflower moth), and Homoeosoma electelltm 
(sunflower head moth). 
[0098] In one preferred embodiment, the invention encompasses an isolated nucleic 
acid molecule comprising a nucleotide sequence that: (a) has a compliment that 
hybridizes to nucleotides 1981-2367 of SEQ ID NO: 1 in 7% sodium dodecyl sulfate 
(SDS), 0.5 M NaP04, 1 mM EDTA at 50^ C. with washing in 0. IXSSC, 0. 1% SDS at 
dS'^C; or (b) is isocoding with the nucleotide sequence of (a); or (c) comprises a 
consecutive 20 base pah* nucleotide portion identical in sequence to a consecutive 20 
base pair nucleotide portion of a nucleotide sequence of (a) or (b); or (d) has at least 
93% sequence identity with SEQ ID NO: 1; or (e) encodes an amino acid sequence 
having at least 91% sequence identity with SEQ ID NO: 2, wherein expression of the 
isolated nucleic acid molecule resuhs in insect control activity. When expressed in a 
heterologous host, the nucleic acid molecule of SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 10, and SEQ ID NO: 31 results in insect control activity against Ostrinia 
mbilalis (European com borer), Plutella xylostella (diamondback moth), Spodoptera 
frugiperda (fall armyworm), Agrotis ipsilon (black cutworm), Helicoverpa zea (com 
earworm), Heliothis virescem (tobacco budworm), Spodoptera exigua (beet 
armyworm), Pectinophora gossypiella (pink boll worm), Trichoplusia ni (cabbage 
looper), Cochyles hospes (banded sunflower moth), Homoeosoma electelhmt 
(sunflower head moth), showing that the nucleotide sequence set forth in SEQ ID NO: 
1, SEQ ID NO: 3, SEQ ID NO: 10, and SEQ ID NO: 3 1 is sufiBcient for such insect 
control activity. 

[0099] In one embodiment, the invention encompasses a nucleic acid molecule 
comprising a nucleotide sequence that has at least 75% sequence identity with 
nucleotides 1981-2367 of SEQ ID NO: 1. Preferably, the isolated nucleic acid 
molecule comprises a nucleotide sequence that has at least 85% sequence identity with 
nucleotides 1981-2367 of SEQ ID NO: 1. More preferably, the isolated nucleic acid 
molecule comprises a nucleotide sequence that has at least 95% sequence identity with 
nucleotides 1981-2367 of SEQ ID NO: 1. Even more preferably, the isolated nucleic 
acid molecule comprises a nucleotide sequence that has at least 99% sequence identity 
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with nucleotides 1981-2367 of SEQ ID NO: 1. Most preferably, the isolated nucleic 
acid molecule comprises nucleotides 1981-2367 of SEQ ID NO: 1 or SEQ E) NO: 3. 
[00100] In another embodiment, the invention encompasses a nucleic acid molecule 
comprising a nucleotide sequence that has at least 93% sequence identity with SEQ ID 
NO: L Preferably, the isolated nucleic acid molecule comprises a nucleotide sequence 
that has at least 95% sequence identity with SEQ ID NO: 1. More preferably, the 
isolated nucleic acid molecule comprises a nucleotide sequence that has at least 99% 
sequence identity with SEQ ID NO: 1. Most preferably, the isolated nucleic acid 
molecule comprises nucleotides 1-2367 of a nucleotide sequence selected from the 
group consisting ofSEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 10, SEQ ID NO: 31, 
and SEQ ID NO: 33. 

[00101] In yet another embodiment, the invention encompasses a nucleic acid molecule 
comprised m a Bacillus thuringiefisis isolate selected from the group consisting of 
C1674, designated NRRL accession B-30556; and C536, designated NRRL accession 
B-30557. In a preferred embodiment, the invention encompasses a nucleic acid 
molecule comprised in an J?, coli clone selected from the group consisting of 
pNOV3910, designated NRRL accession B-30553; pNOV3911, designated NRRL 
accession B-30552; pNOV3906, designated NRRL accession B-3 0555; pNOV3905, 
designated NRRL accession B-30554; and pNOV3912, designated NRRL accession 
B-30551, whose expression results in an insecticidal toxin. 

[00102] The present invention also encompasses an isolated nucleic acid molecule 
which encodes a toxin comprising an amino acid sequence with at least 75% identity 
with amino acids 661-788 of the amino acid sequence of SEQ ID NO: 2, Preferably, 
the isolated nucleic acid molecule encodes a toxin comprising an amino acid sequence 
which has at least 85% identity with amino acids 661-788 of the amino acid sequence 
of SEQ ID NO: 2. More preferably, the isolated nucleic acid molecule encodes a toxin 
comprising an amino acid sequence which has at least 95% identity with amino acids 
661-788 of the amino acid sequence of SEQ ID NO: 2. Even more preferably, the 
isolated nucleic acid molecule encodes a toxin comprising an amino acid sequence 
which has at least 99% identity with amino acids 661-788 of the amino acid sequence 
of SEQ ID NO: 2. Most preferably, the isolated nucleic acid molecule encodes a toxin 
comprising amino acids 661-788 of the amino acid sequence of SEQ ID NO: 2. 
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[00103] In another embodiment, the isolated nucleic acid molecule encodes a toxin 
comprising an amino acid sequence which has 91% identity to the amino acid 
sequence set forth in SEQ ID NO: 2. Preferably, the isolated nucleic acid molecule 
encodes a toxin comprising an amino acid sequence which has 95% identity to the 
ammo acid sequence set forth in SEQ ID NO: 2. More preferably, the isolated nucleic 
acid molecule encodes a toxin comprising an amino acid sequence which has 99% 
identity to the amino acid sequence set forth in SEQ ID NO: 2. Most preferably, the 
isolated nucleic acid molecule encodes a toxin comprising the amino acid sequence set 
forth in SEQ ID NO: 2, SEQ ID NO: 1 1, or SEQ ID NO: 32. 

[00104] The present invention also encompasses recombinant vectors comprising the 
nucleic acid sequences of this invention. In such vectors, the nucleic acid sequences 
are preferably comprised in expression cassettes comprising regulatory elements for 
expression of the nucleotide sequences in a transgenic host cell capable of expressing 
the nucleotide sequences. Such regulatory elements usually comprise promoter and 
termination signals and preferably also comprise elements allowing efficient 
translation of polypeptides encoded by the nucleic acid sequences of the present 
invention. Vectors comprising the nucleic acid sequences are usually capable of 
replication in particular host cells, preferably as extrachromosomal molecules, and are 
therefore used to amplify the nucleic acid sequences of this invention in the host cells. 
In one embodiment, host cells for such vectors are microorganisms, such as bacteria, 
in particular jE. colt In another embodiment, host cells for such recombinant vectors 
are endophytes or epiphytes. A preferred host cell for such vectors is a eukaryotic cell, 
such as a yeast cell, a plant cell, or an insect cell. Plant cells such as maize cells or 
cotton are most preferred host cells. In another preferred embodiment, such vectors are 
viral vectors and are used for replication of the nucleotide sequences in particular host 
cells, e.g. insect cells or plant cells. Recombinant vectors are also used for 
transformation of the nucleotide sequences of this invention into transgenic host cells, 
whereby the nucleotide sequences are stably integrated into the DNA of such 
transgenic host cells. In one, such transgenic host cells are prokaryotic cells. In a 
preferred embodiment, such transgenic host cells are eukaryotic cells, such as yeast 
cells, insect cells, or plant cells. In a most preferred embodiment, the transgenic host 
cells are plant cells, such as maize cells or cotton cells. 
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[00105] In yet another aspect, the present invention provides toxins produced by the 
^ression of the nucleic acid molecules of the present invention. 

[00106] In preferred embodiments, the insecticidal toxins of the mvention comprise a 
polypeptide encoded by a nucleotide sequence of the invention. In a fiirther preferred 
embodiment, the toxin is produced by a Bacillus thuringiensis isolated selected from 
the group consisting of CI 674, designated NRRL accession B-30556; and C536, 
designated NRRL accession B-30557. 

[00107] In another embodiment, the toxins are produced by an E. coli clone selected 
from the group consisting of pNOV3910, designated NRRL accession B-30553; 
pNOV3911, designated NRRL accession B-30552; pNOV3906, designated NRRL 
accession B-30555; pNOV3905, designated NRRL accession B-30554; and 
pNOV3912, designated NRRL accession B-3055 1 . In a preferred embodiment, the 
toxin is produced by the expression of the nucleic acid molecule comprising 
nucleotides 1-2367 of SEQ ID NO: 1 or nucleotides 1-2367 of SEQ ID NO: 3,or 
nucleotides 1-2367 of SEQ ID NO: 10, or nucleotides 1-2367 of SEQ ID NO: 31. 

[00108] The present invention encompasses a toxin which comprises an amino acid 
sequence which has at least 75% identity with amino acids 661-788 of the amino acid 
sequence of SEQ ID NO: 2. Preferably, the toxin comprises an amino acid sequence 
which has at least 85% identity with amino acids 661-788 of the amino acid sequence 
of SEQ ID NO: 2. Even more preferably, the toxin comprises an amino acid sequence 
which has at least 95% identity with amino acids 661-788 of the amino acid sequence 
of SEQ ID NO: 2. Even more preferably, the toxin comprises an amino acid sequence 
which has at least 99% identity with amino acids 661-788 of the amino acid sequence 
of SEQ ID NO: 2. Most preferably, the toxin comprises amino acids 661-788 of SEQ 
ID NO: 2. 

[00109] In another preferred embodiment, a toxin of the present invention comprises an 
amino acid sequence which has at least 91% identity with the amino acid sequence set 
forth in SEQ ID NO: 2. Preferably, the toxin comprises an amino acid sequence which 
has at least 95% identity with the amino acid sequence set forth in SEQ ID NO: 2. 
More preferably, the toxin comprises an amino acid sequence which has at least 99% 
identity with the ammo acid sequence set forth in SEQ ID NO: 2. Most preferably, the 



29 



wo 03/075655 



# 



PCTAJS03/04735 



toxin comprises the amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 1 1, 
orSEQIDNO: 32. 

[001 10] The toxins of the present invention have insect control activity when tested 
against insect pests in bioassays. In another preferred embodiment, the toxins of the 
invention are active against lepidopteran insects, preferably against Ostrinia mbilalis 
(European com borer), Plutella xylostella (diamondback moth), Spodopterafrugiperda 
(fall armyworm), Agrotis ipsilon (black cutworm), Helicaverpa zea (com earworm), 
Heliothis virescens (tobacco budworm), Spodoptera exigiia (beet armyworm), 
Pectinophora gossypiella (pink boll worm), Trichoplusia ni (cabbage looper), 
Cochyles hospes (banded sunflower moth), and Homoeosoma electellum (sunflower 
head moth). The insect controlling properties of the insecticidal toxins of the invention 
are further illustrated in Examples 6, 8, 9 and 13 . 

[00111] The present invention also encompasses hybrid toxins which are active against 
insects, wherein the hybrid toxins are encoded by nucleic acid molecules comprising a 
nucleotide sequence that: (a) has a compliment that hybridizes to nucleotides 1981- 
2367 of SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 MNaP04, 1 mM 
EDTA at 50^ C. with washing in O.IXSSC, 0.1% SDS at 65°C.; or (b) is isocoding 
with the nucleotide sequence of (a); or (c) comprises a consecutive 20 base pair 
nucleotide portion identical in sequence to a consecutive 20 base pair nucleotide 
portion of a nucleotide sequence of (a) or (b), wherein expression of the nucleic acid 
molecule results in insect control activity. In a preferred embodiment, the hybrid toxin 
is encoded by the approximately 2.4 kb DNA fragment comprised in pNOV3912, 
deposited inthe£ coU strain DH5a designated NRRL accession B-30551, whose 
expression results in an insecticidal hybrid toxm. Specifically exemplified herein is a 
hybrid toxin that is encoded by the nucleotide sequence set forth in SEQ ID NO: 10. 
When expressed in a heterologous host, the nucleic acid molecule of SEQ ID NO: 10 
results in insect control activity against Ostrinia mbilalis (European com borer), 
Plutella xylostella (diamondback moth), Spodoptera frupperda (fall armyworm), 
Agrotis ipsilon (black cutworm), Helicaverpa zea (com earworm), Heliothis virescem 
(tobacco budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella 
(pink boll worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded 
sunflower moth), and Homoeosoma electellum (sunflower head moth). The insect 
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controlling properties of the exemplified hybrid toxm of the invention is further 
illustrated in Example 9. 

[00112] The present mvention also encompasses hybrid toxms active against insects 
that comprise a carboxy-terminal region of a Vip3 toxin jomed in the amino to carboxy 
direction to an amino-terminal region of a different \^p3 toxm, wherein the carboxy- 
terminal region comprises an amino acid sequence which has at least 75% identity, 
preferably at least 85% identity, more preferably at least 95% identity, most preferably 
at least 99% identity, with amino acids 661-788 of SEQ ID NO: 2; and wherein the 
amino-terminal region has at least 75% identity, preferably at least 85% identity, more 
preferably at least 95% identity, most preferably at least 99% identity, with amino 
acids 1-660 of SEQ ID NO: 5. In a preferred embodiment, the carboxy-terminal region 
comprises amino acids 661-788 of SEQ ID NO: 2, and the amino-terminal region 
comprises amino acids 1-660 of SEQ ID NO: 65 In a more preferred embodiment, the 
hybrid toxin comprises amino acids 1-788 of SEQ ID NO: 1 1. 

[00113] In further embodiments, the nucleotide sequences of the invention can be 
modified by incorporation of random mutations in a technique known as in vitro 
recombination or DNA shuffling. This technique is described in Stemmer et al. Nature 
370:389-391 (1994) and U.S. Patent 5,605,793, which are incorporated herein by 
reference. Millions of mutant copies of a nucleotide sequence are produced based on 
an original nucleotide sequence of this invention and variants with improved 
properties, such as increased insecticidal activity, enhanced stability, or different 
specificity or range of target insect pests are recovered. The method encompasses 
forming a mutagenized double-stranded polynucleotide from a template double- 
stranded polynucleotide comprising a nucleotide sequence of this invention, wherein 
the template double-stranded polynucleotide has been cleaved into double-stranded- 
random fragments of a desired size, and comprises the steps of adding to the resultant 
population of double-stranded random fragments one or more single or double- 
stranded oligonucleotides, wherein said oligonucleotides comprise an area of identity 
and an area of heterology to the double-stranded template polynucleotide; denaturing 
the resultant mixture of double-stranded random fragments and oligonucleotides into 
single-stranded fragments; incubating the resultant population of single-stranded 
fiBgments with a polymerase under conditions which resuh in the annealing of said 
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single- stranded fragments at said areas of identity to form pairs of annealed 
fragments, said areas of identity being sufiBcient for one member of a pair to prime 
replication of the other, thereby forming a mutagenized double-stranded 
polynucleotide; and repeating the second and third steps for at least two frirther cycles, 
wherein the resultant mixture in the second step of a fluther cycle includes the 
mutagenized double-stranded polynucleotide from the third step of the previous cycle, 
and the fiirther cycle forms a forther mutagenized double-stranded polynucleotide. In a 
preferred embodiment, the concentration of a single species of double- stranded 
random fragment in the population of double-stranded random fragments is less than 
1% by weight of the total DNA. In a further preferred embodiment, the template 
double-stranded polynucleotide comprises at least about 100 species of 
polynucleotides. In another preferred embodiment, the size of the double-stranded 
random fragments is from about 5 bp to 5 kb. In a fiirther preferred embodiment, the 
fourth step of the method comprises repeating the second and the third steps for at least 
10 cycles. 

Expression of the Nucleotide Sequences in Heterologous Mcrobial Hosts 

[00114] As biological insect control agents, the insecticidal toxins are produced by 
expression of the nucleotide sequences in heterologous host cells capable of expressing 
the nucleotide sequences. In a first embodiment, B, thuringiemis cells comprising 
modifications of a nucleotide sequence of this invention are made. Such modifications 
encompass mutations or deletions of existing regulatory elements, thus leading to 
altered exprej^sion of the nucleotide sequence, or the incorporation of new regulatory 
elements controlling the expression of the nucleotide sequence. In another 
embodiment, additional copies of one or more of the nucleotide sequences are added to 
Bacillus thuringieftsis cells either by insertion into the chromosome or by introduction 
of extrachromosomally replicating molecules containing the nucleotide sequences. 

[00115] In another embodiment, at least one of the nucleotide sequences of the 
invention is inserted into an appropriate expression cassette, comprising a promoter 
and termination signals. Expression of the nucleotide sequence is constitutive, or an 
inducible promoter responding to various types of stimuli to initiate transcription is 

32 



wo 03/075655 



PCT/US03TO735 



used. In a preferred embodiment, the cell in which the toxin is expressed is a 
microorganism, such as a virus, a bacteria, or a fongus. In a preferred embodiment, a 
virus, such as a baculovirus^ contains a nucleotide sequence of the invention m its 
genome and e?qpresses large amounts of the corresponding insecticidal toxin after 
infection of appropriate eukaiyotic cells that are suitable for virus replication and 
e?q)ression of the nucleotide sequence. The msecticidal toxin thus produced is used as 
an insecticidal agent. Alternatively, baculoviruses engineered to include the nucleotide 
sequence are used to infect insects in vivo and kill them either by expression of the 
insecticidal toxin or by a combination of vu-al infection and expression of the 
insecticidal toxin. 

[00116] Bacterial cells are also hosts for the expression of the nucleotide sequences of 
the invention. In a preferred embodiment, non-pathogenic symbiotic bacteria, which 
are able to live and replicate within plant tissues, so-called endophytes, or non- 
pathogenic symbiotic bacteria, which are capable of colonizing the phyllosphere or the 
rhizosphere, so-called epiphytes, are used. Such bacteria include bacteria of the genera 
Agrobacteriwn, Alcaligenes, Azospirillum, Azotobacter, Bacillus, Clavibacter, 
Enterobacier, Erwima, Flavobacter, Klebsiella, Psetidomofjos, Rhizobium, Serratia, 
Streptomyces mdXaftthomoiias. Symbiotic fiingi, such as Trichoderma and 
Gliocladiian are also possible hosts for e?q)ression of the inventive nucleotide 
sequences for the same purpose. 

[00117] Techniques for tiiese genetic manipulations are specific for the different 

available hosts and are known in the art. For example, the expression vectors pKK223- 
3 and pKK223-2 can be used to express heterologous genes mE. coli, either m 
transcriptional ortranslational fusion, behind the tac or trc promoter. For the 
expression of operons encoding multiple OKFs, the simplest procedure is to insert the 
operon into a vector such as pKK223- 3 in transcriptional fusion, allowing the cognate 
ribosome binding site of the heterologous genes to be used. Techniques for 
overexpression in gram-positive species such as Bacillus are also known in the art and 
can be used in the context of this invention (Quax et al. In:Industrial 
Microorganisms:Basic and Applied Molecular Genetics, Eds. Baltz et al., American 
Society for Microbiology, Washington (1993)). Alternate systems for overexpression 
rely for example, on yeast vectors and include the use of Pichia, Saccharomyces and 
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Kluyveromyces (Sreekrishna, In:Industrial microorganisms:basic and applied 
molecular genetics, Baltz, Hegeman, and Skatmd eds., American Sodety for 
Microbiology, Washington (1993); Dequin & Barre, Biotechnology L2: 173- 177 
(1994); van den Berg et al.. Biotechnology 8:135-139 (1990)). 

Plant transformation 

[00118] In a particularly preferred embodiment, at least one of the insecticidal toxins of 
the invention is expressed in a higher organism, e.g., a plant. In this case, transgenic 
plants expressing eflFective amounts of the toxins protect themselves from insect pests. 
When the insect starts feeding on such a transgenic plant, it also ingests the expressed 
toxins. This will deter the insect from further biting into the plant tissue or may even 
harm or kill the insect. A nucleotide sequence of the present invention is inserted into 
an expression cassette, which is then preferably stably integrated in the genome of said 
plant. In another preferred embodiment, the nucleotide sequence is included in a non- 
pathogenic self- replicating virus. Plants transformed in accordance with the present 
invention may be monocots or dicots and include, but are not limited to, maize, wheat, 
barley, rye, sweet potato, bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, 
turnip, radish, spinach, asparagus, onion, garlic, pepper, celery, squash, pumpkin, 
hemp, zucchini, apple, pear, quince, melon, plum, cherry, peach, nectarine, apricot, 
strawberry, grape, raspberry, blackberry, pineapple, avocado, papaya, mango, banana, 
soybean, tomato, sorghum, sugarcane, sugar beet, sunflower, rapeseed, clover, 
tobacco, carrot, cotton, alfalfa, rice, potato, eggplant, cucumber, Arabidopsis, and 
woody plants such as coniferous and deciduous trees. 

[00119] Once a desired nucleotide sequence has been transformed into a particular plant 
species, it may be propagated in that species or moved into other varieties of the same 
species, particularly including commercial varieties, using traditional breeding 
techniques. 

[00120] A nucleotide sequence of this invention is preferably expressed in transgenic 
plants, thus causing the biosynthesis of the corresponding toxin in the transgenic 
plants. In this way, transgenic plants with enhanced resistance to insects are generated. 
For their expression in transgenic plants, the nucleotide sequences of the invention 



34 



wo 03/075655 



PCT/US03/04735 



may require modification and optimization. Although in many cases genes from 
microbial organisms can be expressed in plants at high levels without modification, 
low expression in transgenic plants may result from microbial nucleotide sequences 
having codons that are not preferred in plants. It is known in the art that all organisms 
have specific preferences for codon usage, and the codons of the nucleotide sequences 
described in this invention can be changed to conform with plant preferences, while 
maintaining the amino acids encoded thereby. Furthermore, high expression in plants 
is best achieved from coding sequences that have at least about 35% GC content, 
preferably more than about 45%, more preferably more than about 50%, and most 
preferably more than about 60%. Microbial nucleotide sequences that have low GC 
contents may express poorly in plants due to the existence of ATTTA motifs that may 
destabilize messages, and AATAAA motifs that may cause inappropriate 
polyadenylation. Although preferred gene sequences may be adequately expressed in 
both monocotyledonous and dicotyledonous plant species, sequences can be modified 
to account for the specific codon preferences and GC content preferences of 
monocotyledons or dicotyledons as these preferences have been shown to differ 
(Murray et al Nucl Acids Res. 17:477-498 (1989)). In addition, the nucleotide 
sequences are screened for the existence of illegitimate splice sites that may cause 
message truncation. All changes required to be made within the nucleotide sequences 
such as those described above are made using well known techniques of site directed 
mutagenesis, PCR, and synthetic gene construction using the methods described in the 
published patent applications EP 0 385 962 (to Monsanto), EP 0 359 472 (to Lubrizol, 
and WO 93/07278 (to Ciba-Geigy). 
[00121] In one embodiment of the invention synthetic genes are made according to the 
procedure disclosed in U.S. Patent 5,625,136, herein incorporated by reference, la this 
procedure, maize preferred codons, i.e., the single codon that most firequently encodes 
that amino acid in maize, are used. The maize preferred codon for a particular amino 
acid can be derived, for example, fi"om known gene sequences fi-om maize. Maize 
codon usage for 28 genes fi-om maize plants is found in Murray et al. Nucleic Acids 
Research 17:477-498 (1989), the disclosure of which is incorporated herein by 
reference. Specifically exemplified synthetic sequences of the present invention made 
with maize optimized codons are set forth in SEQ ID NO: 3 and SEQ ID NO: 33. 
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[00122] In this manner, the nucleotide sequences can be optimized for expression in 
any plant. It is recognized that all or any part of the gene sequence may be optimized 
or synthetic. That is, synthetic or partially optimized sequences may also be used. 

[00123] For efficient initiation of translation, sequences adjacent to the initiating 
methionine may require modification. For example, they can be modified by the 
inclusion of sequences known to be effective in plants. Joshi has suggested an 
appropriate consensus for plants (NAR 15:6643-6653 (1987)) and Clonetech suggests 
a fiuther consensus translation initiator (1993/1994 catalog, page 210). These 
consensuses are suitable for use vntii the nucleotide sequences of this invention. The 
sequences are incorporated into constructions comprising the nucleotide sequences, up 
to and including the ATG (whilst leaving the second amino acid unmodified), or 
alternatively up to and including the GTC subsequent to the ATG (with the possibility 
of modifying the second amino acid of the transgene). 

[00124] The novel vip3 toxin genes of the present invention, either as their native 
sequence or as optimized synthetic sequences as described above, can be operably 
fused to a variety of promoters for expression in plants including constitutive, 
inducible, temporally regulated, developmentally regulated, chemically regulated, 
tissue-preferred and tissue-specific promoters to prepare recombinant DNA molecules, 
i.e., chimeric genes. The choice of promoter will vary depending on the temporal and 
spatial requirements for expression, and also depending on the target species. Thus, 
expression of the nucleotide sequences of this invention in leaves, in stalks or stems, in 
ears, in inflorescences (e.g. spikes, panicles, cobs, etc.), in roots, and/or seedlings is 
preferred. In many cases, however, protection against more than one type of insect pest 
is sought, and thus expression in multiple tissues is desirable. Although many 
promoters from dicotyledons have been shown to be operational in monocotyledons 
and vice versa, ideally dicotyledonous promoters are selected for expression in 
dicotyledons, and monocotyledonous promoters for expression in monocotyledons. 
However, there is no restriction to the provenance of selected promoters; it is sufficient 
that they are operational in driving the expression of the nucleotide sequences in the 
desired cell. 

[00125] Preferred constitutive promoters include the CaMV 35S and 19S promoters 
(Fraley etaL, U.S. Pat. No. 5,352,605 issued Oct. 4, 1994). An additionally preferred 
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promoter is derived from any one of several of the actin genes, which are expressed in 
most cell types. The promote expression cassettes described by McEhroy et al (Mol. 
Gen. Genet. 231: lSO-160 (1991)) can be easily modified for the expression of the 
novel toxin gene and are particularly suitable for use in monocotyledonous hosts. 

[00126] Yet another preferred constitutive promoter is derived from ubiquitin, wluch is 
another gene product known to accumulate in many cell types. A ubiquitin promoter 
has been cloned from several species for use in transgenic plants, for example, 
sunflower (Binet et al, 1991. Plant Science 79: 87-94), maize (Christensen et al, 
1989. Plant Molec. Biol. 12: 619-632), and arabidopsis (Nonis et al. 1993. Plant 
Molec. Biol. 21 :895-906). The maize ubiquitin promoter has been developed m 
transgenic monocot systems and its sequence and vectors constructed for monocot 
transformation are disclosed in the patent publication EP 0 342 926. The ubiquitin 
promoter is suitable for the expression of the novel toxin gene m transgenic plants, 
especially monocotyledons. 

[00127] Tissue-specific or tissue-preferential promoters useful for the expression of the 
novel toxin genes of the invention in plants, particularly maize, are those that direct 
expression in root, pith, leaf or pollen. Such promoters are disclosed in WO 93/07278, 
herein incorporated by reference in its entirety. Other tissue specific promoters useful 
in the present invention include the cotton rubisco promoter disclosed in US Patent 
6,040,504; the rice sucrose synthase promoter disclosed in US Patent 5,604,121; and 
the cestrum yellow leaf curling virus promoter disclosed in WO 01/73087, all 
incorporated by reference. Chemically inducible promoters useful for directing the 
expression of the novel toxin gene in plants are disclosed in US Patent 5,614,395 
herein incorporated by reference in its entirety. 

[00128] The nucleotide sequences of this invention can also be expressed under the 
regulation of promoters that are chemically regulated. This enables the Vip3 toxins to 
be synthesized only when the crop plants are treated with the inducing chemicals. 
Preferred technology for chemical induction of gene expression is detailed in the 
published application EP 0 332 104 (to Ciba- Geigy) and U.S. Patent 5,614,395. A 
preferred promoter for chemical induction is the tobacco PR-la promoter 

[00129] A preferred category of promoters is that which is wound inducible. Numerous 
promoters have been described which are expressed at wound sites and also at the sites 
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of phytopathogen infection. Ideally, such a promoter should only be active locally at 
the sites of infection, and in this way the insecticidal toxins only accumulate in cells 
that need to synthesize the insecticidal toxms to kill the mvading insect pest. Preferred 
promoters of this kind include those described by Stanford et al. Mol. Gen. Genet. 
215:200-208 (1989), Xu etal Plant Molec. Biol. 22:573-588 (1993), Logemann et al. 
Plant Cell 1:151-158 (1989), Rohraieier & Lehle, Plant Molec. Biol. 22:783-792 
(1993), Firek ei al Plant Molec. Biol. 22:129-142 (1993), and Warner et al Plant J. 
3:191-201 (1993). 

[00130] Preferred tissue specific expression patterns include green tissue specific, root 
specific, stem specific, and flower specific. Promoters suitable for expression in green 
tissue include many that regulate genes involved in photosynthesis and many of these 
have been cloned fi-om both monocotyledons and dicotyledons. A preferred promoter 
is the maize PEPC promoter from the phosphoenol carboxylase gene (Hudspeth & 
Grula, Plant Molec. Biol. 12:579-589 (1989)). A preferred promoter for root specific 

• expression is that described by de Framond (FEBS 290: 103-106 (1991); EP 0 452 269 
to Ciba- Geigy). A preferred stem specific promoter is that described in U.S. Patent 
5,625,136 (to Ciba-Geigy) and which drives expression of the maize trpA gene. 

[00131] Further preferred embodiments are transgenic plants expressing the nucleotide 
sequences in a wound-inducible or pathogen infection-inducible manner. 

[00132] In addition to the selection of a suitable promoter, constructions for expression 
of an insecticidal toxin in plants require an appropriate transcription terminator to be 
attached downstream of the heterologous nucleotide sequence. Several such 
terminators are available and known in the art (e.g. tml from CaMV, E9 from rbcS). 
Any available terminator known to function in plants can be used in the context of this 
invention. 

[00133] Numerous other sequences can be incorporated into expression cassettes 
described in this invention. These include sequences that have been shown to enhance 
expression such as intron sequences (e.g. from Adhl and bronzel) and viral leader 
sequences (e.g. from TMV, MCMV and AMV). 

[00134] It may be preferable to target expression of the nucleotide sequences of the 
present invention to diflFerent cellular localizations in the plant. In some cases, 
localization m the cytosol may be desirable, whereas m other cases, localization in 
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some subcellular organelle may be preferred. Subcellular localization of transgene- 
encoded enzymes is undertaken using techniques well known in the art. Typically, the 
DNA encoding the target peptide from a known oiganelle-targeted gene product is 
manipulated and fiised upstream of the nucleotide sequence. Many such target 
sequences are known for the chloroplast and their functioning in heterologous 
constructions has been shown. The e;q)ression of the nucleotide sequences of the 
present invention is also targeted to the endoplasmic reticulum or to the vacuoles of 
the host cells. Techniques to achieve this are well known in the art. 

[00135] Numerous transformation vectors available for plant transformation are known 
to those of ordinary skill in the plant transformation art, and the nucleic acid molecules 
of Ae invention can be used in conjunction with any such vectors. The selection of 
vector will depend upon the preferred transformation technique and the target plant 
species for transformation. For certain target species, different antibiotic or herbicide 
selection markers may be preferred. Selection markers used routinely in 
transformation include the rptllgene, which confers resistance to kanamycin and 
related antibiotics (Messing & Vierra, 1982. Gene 19: 259-268; and Bevan etaL, 
1983. Nature 304: 184-187), the bar gene, which confers resistance to the herbicide 
phosphinothricin (White et al, 1990. Nucl. Acids Res 18: 1062, and Spencer et al, 
1990. Theor. Appl. Genet 79: 625-631), the hph gene, which confers resistance to the 
antibiotic hygromycin (Blochinger & Diggelmann, Mol Cell Biol 4: 2929-2931), and 
the dhfr gene, which confers resistance to methatrexate (Bourouis et al, 1983. EMBO 
J. 2(7): 1099-1 104), the EPSPS gene, which confers resistance to glyphosate (U.S. 
Patent Nos. 4,940,935.and 5,188,642), and the mannose-6-phosphate isomerase gene, 
which provides the ability to metabolize mannose (U.S. Patent Nos. 5,767,378 and 
5,994,629). The choice of selectable marker is not, however, critical to the invention. 

[00136] In another preferred embodiment, a nucleotide sequence of the present 

invention is directly transformed into the plastid genome. A major advantage of plastid 
transformation is that plastids are generally capable of expressing bacterial genes 
without substantial modification, and plastids are capable of expressing multiple open 
reading frames under control of a single promoter. Plastid transformation technology is 
extensively described in U.S. Patent Nos. 5,451,513, 5,545,817, and 5,545,818, in PCT 
application no. WO 95/16783, and in McBride et aL (1994) Proc. Nati. Acad. Sci. 
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USA 91, 7301-7305. The basic technique for chloroplast transformation involves 
introducing regions of cloned plastid DNA flanking a selectable marker together with 
the gene of interest into a suitable target tissue, e.g., using biolistics or protoplast 
transformation (e.g., calcium chloride or PEG mediated transformation). The 1 to 1.5 
kb flanking regions, termed targeting sequences, fecilitate homologous recombination 
with the plastid genome and thus allow the replacement or modification of specific 
regions of the plastome. Initially, point mutations in the chloroplast 16S rRNA and 
rpsl2 genes conferring resistance to spectinomycin and/or streptomycin are utilized as 
selectable markers for transformation (Svab, Z., Hajdukiewicz, P., andMaliga, P. 
(1990) Proc. Nati. Acad. Sci. USA 87, 8526-8530; Staub, J. M., and Maliga, P. (1992) 
Plant Cell 4, 39-45). This resulted in stable homoplasmic transformants at a frequency 
of approximately one per 100 bombardments of target leaves. The presence of cloning 
sites between these markers allowed creation of a plastid targeting vector for 
introduction of foreign genes (Staub, J.M., and Maliga, P. (1993) EMBO J. 12, 601- 
606). Substantial increases in transformation frequency are obtained by replacement of 
the recessive rRNA or r-protein antibiotic resistance genes with a dommant selectable 
marker, the bacterial aadA gene encoding the spectinomycin-cletoxifying enzyme 
aminoglycoside- 3'- adenyltransf erase (Svab, Z., and Maliga, P. (1993) Proc. Natl. 
Acad. Sci. USA 90, 913-917). Previously, this marker had been used successfully for 
high-frequency transformation of the plastid genome of the green alga 
Chlamydomonas reinhardtii (Goldschmidt- Clermont, M. (1991) Nucl. Acids Res, 
19:4083r4089). Other selectable markers useful for plastid transformation are known 
in the art and encompassed within the scope of the invention. Typically, approximately 
15-20 cell division cycles following transformation are required to reach a 
homoplastidic state. Plastid expression, in which genes are inserted by homologous 
recombination into all of the several thousand copies of the circular plastid genome 
present in each plant cell, takes advantage of the enormous copy number advantage 
over nuclear- expressed genes to permit expression levels that can readily exceed 10% 
of the total soluble plant protein. In a preferred embodunent, a nucleotide sequence of 
the present invention is inserted into a plastid-targeting vector and transformed into the 
plastid genome of a desired plant host. Plants homoplastic for plastid genomes 
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containing a nucleotide sequence of the present invention are obtained, and are 
preferentially capable of high expression of the nucleotide sequence. 

Combinations of Insect Control Principles 

[00137] The pesticidal toxins of the invention can be used in combination with Bt 5- 
endotoxins or other pesticidal principles to increase pest target range. Furthermore, the 
use of the pesticidal toxins of the invention in combination with Bt 5-endotoxins or 
other pesticidal principles of a distinct nature has particular utility for the prevention 
and/or management of insect resistance, 

[00138] The various insecticidal crystal proteins from Bacillus thuringiensis have been 
classified based upon their spectrum of activity and sequence similarity. The 
classification put forth by Hofte and Whiteley, Microbiol. Rev. 53: 242-255 (1989) 
placed the then known insecticidal crystal proteins into four major classes. Generally, 
the major classes are defined by the spectrum of activity, with the Cryl proteins active 
against Lepidoptera, Cry2 proteins active against both Lepidoptera and Diptera, Cry3 
proteins active against Coleoptera, and Cry4 proteins active against Diptera. 

[00139] Within each major class, the 5-endotoxins are grouped according to sequence 
similarity. The Cryl protems are typically produced as 130-140 kDa protoxin proteins 
that are proteolytically cleaved to produce active toxins that are about 60-70 kDa. The 
active portion of the 5-endotoxin resides in the NH2 -terminal portion of the full- 
length molecule. Hofte and Whiteley, supra, classified the then known Cryl proteins 
into six groups, 1 Aa, lAb, 1 Ac, IB, IC, and ID. Since then, proteins classified as 
CrylEa, CrylFa, Cry9A, Cry9C and Cry9B, as well as others, have also been 
characterized. 

[00140] The spectrum of insecticidal activity of an individual 5-endotoxin from 
Bacillus thuringiensis tends to be quite narrow, with a given 5-endotoxin being active 
against only a few insects. Specificity is the result of the efiiciency of the various steps 
involved in producing an active toxin protein and its subsequent ability to interact with 
the epithelial cells in the insect digestive tract. In one preferred embodiment, 
expression of the nucleic acid molecules of the invention in transgenic plants is 
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accompanied by the esqpression of one or more Bt 8-endotoxins. Particularly preferred 
Bt 5-endotoxins are those disclosed in U.S. Patent 5,625,136, herein incorporated by 
reference. 

[00141] It is well known that many 5-endotoxin proteins from Bacillus thuringiensis are 
actually expressed as protoxins. These protoxins are solubilized in the alkalme 
environment of the insect gut and are proteolytically converted by proteases into a 
toxic core fragment (Hofte and Whiteley, Microbiol. Rev. 53: 242-255 (1989)). For 5- 
endotoxin proteins of the Cryl class, the toxic core fragment is localized in theN- 
terminal half of the protoxin. It is within the scope of the present invention that genes 
encoding either the full-length protoxin form or the tnmcated toxic core fragment of 
the novel toxin proteins can be used in plant transformation vectors to confer 
insecticidal properties upon the host plant. 

[00142] Other insecticidal prmciples include protease inhibitors (both serine and 
cysteine types), lectins, a-amylase, peroxidase and cholesterol oxidase. Other Vip 
genes, such as v//?! A(a) and v//?2A(a) as disclosed in U.S. Pat. No, 5,849,870 and 
herein incorporated by reference, are also usefiil in the present invention. 

[00143] This co-expression of more than one insecticidal principle in the same 
transgenic plant can be achieved by genetically engineering a plant to contain and 
express all the genes necessary. Alternatively, a plant. Parent 1, can be genetically 
en^eered for the expression of genes of the present invention. A second plant. Parent 
2, can be genetically engineered for the expression of a supplemental insect control 
principle. By crossing Parent 1 with Parent 2, progeny plants are obtained which 
express all the genes introduced into Parents 1 and 2. 

[00144] The present invention fiuther encompasses variants of the disclosed nucleic 
acid molecules. Naturally occurring variant sequences can be identified and/or isolated 
with the use of well-known molecular biology techmques, as, for Kcample, with PGR 
and hybridization techniques as outlined below. 

[00145] Variant vip3 nucleotide sequences include synthetically derived nucleotide 
sequences, such as those generated, for example, by using site-directed mutagenesis or 
those made by whole domain swaps, but which still exhibit pesticidal activity. 
Methods for mutagenesis and nucleotide sequence alterations are well known m the 
art. See, for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel 
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et al. (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192; Walker and 
Gaastra, eds. (1983) Techniques in Molecular Biology (MadMQUan Publishing 
Company, New York) and the references cited thereia Generally, a nucleotide 
sequence of the invention will have at least 80%, preferably 85%, 90%, 95%, up to 
98% or more sequence identity to its respective reference yip3 nucleotide sequence, 
and have pesticidal activity. 
[00146] Variant vip3 nucleotide sequences also encompass sequences derived from a 
mutagenic and recombinogenic procedure such as DNA shuffling. With such a 
procedure, one or more different vip3 sequences of the present invention, for example, 
without limitation, v/p3C(a), v/p3C(b), v//?3A-C, and v7p3C-12168 can be recombined 
together or with other vip3 or related sequences, for example, and without limitation, 
v(p3 A (SEQ ID NO: 4), vzp3B (SEQ ID NO: 6), and vip3Z (SEQ ID NO: 8), to create 
new vip3 nucleic acid molecules encoding Vip3 toxins possessing the desired 
properties. In this manner, libraries of recombinant vip3 polynucleotides are generated 
from a population of sequence related vip3 polynucleotides comprising sequence 
regions that have substantial sequence identity and can be homologously recombined 
in vitro or in vivo. Strategies for such DNA shuffling are known in the art. See, for 
example, Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751; Stemmer 
(1994) Nature 370:389-391; Crameri et al, (1997) Nature Biotech. 15:436-438; Moore 
et al. (1997) J. Mol. Biol. 272:336-347; Zhang et al. (1997) Proc. Nati. Acad. Sci. USA 
94:4504-4509; Crameri et al. (1998) Nature 391:288-291; International Patent 
Application WO 99/57128, and U.S. Pat. Nos. 5,605,793, 5,837,458 and 6,335,179. 

[00147] Mutagenesis methods as disclosed herein can be combined with high- 
throughput, screening methods to detect the pesticidal activity of cloned, mut^enized 
Vip3 polypeptides in host cells. Mutagenized DNA molecules that encode active Vip3 
polypeptides (e.g., secreted and detected by antibodies; or insecticidal in an insect 
bioassay) can be recovered from the host cells and rapidly sequenced using standard 
art procedures. These methods allow the rapid determination of the importance of 
individual amino acid residues in a Vip3 polypeptide of interest, and can be applied to 
polypeptides of unknown structure. 

[00148] The libraries of recombinant vip3 genes that are produced using DNA shuffling 
methods are screened to identify those that exhibit improved properties for use in 
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protecting plants against pests. Included among properties for which DNA shuffling is 
useful for obtaining improved vip3 pest resistance genes are increased potency against 
a target pest, increased target pest range, decreased susceptibility to development of 
resistance by pests, increased expression level, increased resistance to protease 
degradation, increased stability in environmental conditions, and reduced toxicity to a 
host plant. By using an appropriate screening strategy, one can simultaneously or 
sequentially obt^ vip3 genes that are optimized for more than one property. 
[00149] DNA shufQing is useful for obtaining vip3 pest resistance genes that encode 
toxins that exhibit enhanced potency against a target pest. Once the shuffling is 
completed, the resulting library of shuffled vip3 genes is screened to identify those that 
exhibit enhanced pesticidal activity. One way of performing this screening is to clone 
the protein-coding region of the shuffled vip3 genes into an expression vector that is 
suitable for expressing the genes in a chosen host cell such as, for example, E. coli or a 
crystal minus stram Bacillus thuringiensis. One skilled in the art will recognize the 
advantages and disadvantages of using either of these two expression systems. For 
example. Bacillus thuringiensis would be more desirable in producing secreted Vip3 
proteins. If desired, clones can be subjected to a preliminary screen, for example, by 
immunoassay, to identify those that produce a Vip3 protein of the correct size. Those 
that are positive in the preliminary screen are then tested in a functional screen to 
identify shuffled vip3 genes that encode a toxin having the desired enhanced activity. 

[00150] A whole insect assay can be used for determining toxicity. In these assays, the 
Vip3 toxins expressed from the shuffled vip3 genes are placed on insect diet, for 
example, artificial diet or plant tissue, and consumed by the target insect. Those clones 
causing growth inhibition or mortality to the target insect can be tested in further 
bioassays to determine potency. Shuffled vip3 genes encoding toxins with enhanced 
potency can be identified as those that have a decreased EC50 (concentration of toxin 
necessary to reduce insect growth by 50%) and/or LC50 (concentration of toxin 
necessary to cause 50% mortality). 

[00151] In vitro assays can also be used for screening shuffled vip3 gene libraries. Such 
assays typically involve the use of cultured insect cells that are susceptible to Vip3 
toxins, and/or cells that express a receptor for the Vip3 toxins, either naturaUy or as a 
result of expression of a heterologous gene. Other in vitro assays can be used, for 
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example, detection of morphological changes in cells, dyes and labels usefiil for 
detecting cell death, or detection of the release of ATPase by cells. One example of a 
suitable in vitro assay using cultured insect cells for Vip3 toxicity is SQ (Spodoptera 
frugiperda) cells. Sf9 is highly sensitive to Vip3 toxins. When Vip3 toxins are mixed 
with S© cells, the cell membrane becomes highly permeable to small molecules. 
When a dye such as trypan blue is added to the cell suspension, those cells which are 
killed by the Vip3 toxin are stained blue. Thus, the cytotoxicity of the Vip3 toxin can 
be determined by image analysis. 
[00152] Additional in vitro assays involve the use of receptors for the Vip3 toxins. One 
such receptor is disclosed in US Patent 6,291,156, herein incorporated by reference. 
The Vip3 receptor protein can be immobilized on a receiving surface, for example, 
without Umitation, a 96-well plate or a nitrocellulose membrane, and exposed to clones 
comprising shuffled vip3 genes. Thus, shuffled vip3 genes that encode functional 
toxins can be identified on the basis of binding affinity to the Vip3 receptor. Further, 
the gene encoding the Vip3 receptor can be transformed into a non-Vip3 susceptible 
cell line, for example the Schneider 2 (S2) Dfosophila cell line, using methods known 
in the art (see for example, Clem and Miller, 1194, Mol. Cel. Biol. 14:5212-522). The 
transformed S2 cells can then be exposed to clones comprising shuffled vzp3 genes. 
Thus, shuffled vip3 genes that encode fimctional toxins can be identified on the basis 
of induction of cell death. 

EXAMPLES 

[00153] The invention will be further described by reference to the following detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless otherwise specified. Standard recombinant DNA and 
molecular cloning techniques used here are well known in the art and are described by 
Ausubel (ed.). Current Protocols in Molecular Biology, John Wiley and Sons, Inc. 
(1994); J. Sambrook, etal,. Molecular Cloning: A Laboratory Mcamal 3dEd, Cold 
Spring Harbor, NY: Cold Spring Harbor Laboratory Press (2001); and by T.J. Silhavy, 
M.L. Herman, and L. W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1984). 
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Example 1 : Identification of Bt Isolates That Harbor Vip3 Homologous Proteins. 

[00154] Three sets of PCR primers, whose sequences are based on the vip3A g&ie 
(SEQ ID NO: S), were used in a PCR reaction to amplify fragments of possible 
homologous vip3 genes fi^om Bacillus thuringiefisis (Bt) isolates. The three primer sets 
used were: 

IF: 5'-ATGAACAAGAATAATACTAAATrAAGCACAAGAGCC-3' (SEQ ID NO: 12) 
' IR: 5'-CrCAACATAGAGGTAATnTAGGTAGATATACCCG-3' (SEQ ID NO: 13) 

p3: 5'-GATGATGGGGTGTATATGCCGTrAG-3* (SEQ ID NO: 14) 

p4: 5'-AATAAATrGTGAAATrCCTCCGTCC-3' (SEQ ID NO: 15) 

4F: 5'-AGTCAAAATGGAGATCAAGGTrGGGGAGATAAC-3 ' (SEQ ID NO: 1 6) 

4R: 5'-TrACITAATAGAGAGATCGTGGAAATGTACAATA-3' (SEQ ID NO: 17) 

[OOlSS] Three PCR products were expected if a Bt isolate comprised a gene identical to 
the vip3A gene (SEQ ID NO: 4). The size of the PCR product generated by primer sets 
IF/IR, p3/p4, and 4F/4R were 377 bp, 344 bp, and 419 bp, respectively. Isolates that 
produced only one or two PCR products, which indicated they may comprise a vip3 
gene with some sequence difference to vip3A, were subjected to further sequence 
analysis. 

Example 2: Cloning and Sequencing of PCR Products to Confirm Vip3 Homologous 
Sequences. 

[00156] Bt isolates identified in Example 1 as produdng one or two PCR products w^e 
subjected to PCR again with primer set IF/IR (SEQ ID NO: 12/SEQ ID NO: 13) as 
well as the following two primers: 
p5: 5'- AATGGAGATGAAGCrTGGGGAGAT-3* (SEQ ID NO: 18) 

p6: 5'-CGTGGAAATGTACAATAGGACCACC-3' (SEQ ID NO: 19) 

[00157] The PCR products were then cloned into a pCR2. 1-Topo (Invitrogen) vector 
and sequenced using standard art procedures. 
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[00158] Three Bi isolates were identified as comprising homologous vip3 genes, 
designated vip3C, with significant sequence differences to vip3A. These 5/ isolates 
were designated C536, C1674 and AB727. 

Example 3: PGR Cloning the Full-length vipSC Gene. 

[00159] The 3' end of the vip3C gene was obtained by PGR using total plasmid DNA 
isolated fi-om Bt strain G536 or G1674 as the template. The primers used were: 
Vip3CF4:5*.GTTrAGAAGATITrcAAACCATrAC.3' (SEQIDNO:20) 
T7: 5'-TTAATACGACrCACTATAGGG-3' (SEQ ID NO: 21) 

Primer T7 is a non-gene specific primer that recognizes the flanking nucleotide 
sequence 3' to the w>3G gene. 

[00160] The PGR products were cloned and sequenced using standard art procedures. 
The fimal fialMength vip3C gene was obtained by PGR using the two primers located at 
the 3' and 5' ends of vip3C: 
Vip3Cc: 5^TITATTrAATAGAAACGTTTrCAAATGATATATG-3' (SEQ ID NO: 22) 
Vip3Cn: 5'-CACCATGAACAAGAATAATACrAAATTAAGCACAAGAG-3'(SEQ JD NO: 23) 
[00161] Two fiill-length vi>3G genes were obtained. The v/p3C gene fi-om Bt isolate 
G536 was designated vi>3G(a), and the vip3C gene isolated firom C1674 was 
designated vip3C(\)). Vip3C(a) and wp3G(b) differ by one nucleotide at position 2213 
(See SEQ ID NO: 1), wherein vi>3G(a) comprises the nucleotide "a" at position 2213, 
thereby encoding the amino acid Glu at position 738 of SEQ ID NO: 2, and wherein 
v//?3C(b) comprises the nucleotide "g" at position 2213, thereby encoding a Gly at 
position 738 of SEQ ID NO: 2. 
[00162] The v/;p3C(a) and the v/>3C(b) genes were each cloned into pETlOl/D-Topo 
expression vectors and designated pNOV391 1 and pNOV3910, deposited in K coli 
DH5a cells, and given the accession numbers NRRL B-30552 and NRRL B-30553, 
respectively. 
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Example 4: Cosmid Cloning the Full-length viphZ Gene. 

[00163] Total DNA was isolated from AB727 by treating freshly grown cells 
resuspended in 100 mM Tris pH 8, 10 mMEDTA with 2 mg/ml lysozyme for 30 
minutes at ZTQ. Proteinase K was added to a final concentration of 100 |ag/ml in 1% 
SDS, 50mM EDTA, IM urea and incubated at 55°C. An equal volume of phenol- 
chloroform-isoamyl alcohol was added. The sample was gently mixed for 5 minutes 
and centrifuged at 3K, This was repeated twice. The aqueous phase was then mixed 
with 0.7 volumes isopropanol and centrifuged. The DNA pellet was washed three 
times with 70% ethanol and gently resuspended in 0.5X TE. 12 ^g of DNA were 
treated with 0.3 unit diSauiA per |j,g of DNA at 37^C in a volume of 100 ^il. Samples 
were taken at 2-niin intervals for 10 minutes. Then 1/10 volume lOX TE was added 
and samples were heated for 30 minutes at 65°C to inactivate the enzyme. The samples 
were subjected to electrophoresis to determine which fraction is m the 40-kb range and 
this sample was used in the ligation. 
[00164] SuperCos cosmid vector (Stratagene, La Jolla, C A) was prepared as described 
by the supplier utilizing the BaanHl cloning site. Prepared SuperCos at 100 ng/ml was 
ligated with the AB727 DNA previously digested with Sau3A at a ratio of 2:1 in a 5 ^1 
volume overnight at 6°C. The ligation mixture was packaged using Gigapack XL III 
(Stratagene) as described by the supplier. Packaged phages were infected into XL- 
IMR E. coli ceUs (Stratagene) as described by the suppUer. The cosmid library was 
plated on L-agar with 50 jig^ml kanamycin and incubated 16 hours at 37°C. 200 
colonies were picked and grown for screening for the presence of the vipSZ gene. 
[00165] The 200 cosmid clones were screened for the presence of the vip3Z gene by 
PGR using primer Vip3ZA: 5'-GGCATTTATGGATTTGCCACT(3GTATC-3' (SEQ 
ID NO: 28) and primer Vip3ZB: 5'-TCCnTGATACGCAGGTGTAATTTCAG-3' 
(SEQ ID NO: 29). 

[00166] One cosmid clone, designated 5g, was shown to comprise the vip3Z gene (SEQ 
ID NO; 8) encoding the Vip3Z protein (SEQ ID NO: 9). 
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Example 5. Maize Optimized vip3C Gene Construction 

[00167] A maize optimized v//?3C gene was made according to the procedure disclosed 
in US Patent 5,625,136, incorporated herin by reference. In this procedure, maize 
preferred codons, i. e., the single codon that most frequently encodes that amino acid 
in maize, are used. The maize preferred codon for a particular amino acid is derived 
from know gene sequences from maize. Maize codon usage for 28 genes from maize 
plants is found in Murray et al (1989, Nucleic Acids Res. 17:477-498). 

[00168] Synthetic v/>3C(a) and v//?3C(b) genes were made which encode the amino 
acid sequence depicted in SEQ ID NO: 2. At positions 2213 and 2214 of SEQ ID NO: 
3, the synthetic v//73C(a) gene comprises nucleotides "a" and "g", respectively, 
encoding the amino acid Glu at position 738 of SEQ ID NO: 2, and the synthetic 
V3fp3C(b) gene comprises nucleotides "g" and "a", respectively, encoding the amino 
acid Gly at position 738 of SEQ ID NO: 2. The synthetic v/p3C(a) and vj/73C(b) genes 
were separately cloned into pETlOl/D-Topo expression vectors and the resulting 
vectors designated pNOV3905, deposited mE, coli BL21 cells and given accession 
number NRRL B-30554, and pNOV3906, deposited in £. coli BL21 cells and given 
the accession number NRRL B-30555, 

Example 6: Bioassay of the Vip3C Protein. 

[00169] Black cutworm diet (BioServ, Frenchtown, NJ) was poured into 50 mm petri 
dishes. The diet was allowed to cool off and a 200 |il suspension of E. coli cells 
comprising pNOV3905, pNOV3906, pNOV3910 or pNOV3911 was pipetted onto the 
diet surface. The solution was uniformly spread with a bacterial loop so that the 
suspension covered the entke surface of the diet. The surface was allowed to dry 
thoroughly. First instar larvae of the lepidopteran species listed in the table below were 
placed on the diet with a fine tip brush. Each species was tested separately. Larval 
mortality, as well as the occurrence of feeding and growth inhibition, was recorded at 
3 days and 5 days after larval infestation of the diet. A sample containing E. coli cells 
without an expression vector acted as the negative control. Vip3 A protein can also be 
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tested in the same bioassay for comparative purposes or for this example, ^p3C data 
was compared to the known activity spectrum of Vip3A. 
[00170] Results are shown in Table 8. Insecticidal activity was observed five days after 
the plates were infested with insects. The data show that Vip3C(a) (firom pNOV391 1 
and pNOV3905) and Vip3C(b) (firom pNOV3910 and 3906) have a broader spectrum 
of activity than the Vip3 A toxin. Tests also indicated that the Vip3C toxin is not active 
against the environmental beneficial insect Danaus plexippus. 

Table 8. 



% Insect Mortality Activity 
Spectrum of 



Insect Tested 


Vip3C(a) 


Vip3C(b) 


VipSA" 


Agrotis ipsilon 


100 


100 


+ 


Helicoverpa zea 


75' 


75' 


+ 


Helioihis virescens 


80 


50 


+ 


Spodoptera exigua 


100 


100 


+ 


Spodoptera frugiperda 


70' 


70' 


+ 


Trichophsia ni 


100 


100 




Pectinophora gossypiella 


50' 


60' 


+ 


Cochylis hospes 


90 


90 


+ 


Homoeosoma electellum 


40' 


30' 


+ 


Ostrinia mbilalis 


100 


100 




Plutella xylostella 


100 


100 





^Surviving insects were observed to have severe feeding and growth inhibition. 
^'A indicates an insect species that is susceptible to Vip3 A. A indicates an 
insect species with little or no susceptibility to Vip3 A 

Example 7. Creation of Transgenic Maize Plants Comprising a vip3C Gene. 

[00171] Maize optimized vip2C (SEQ ID NO: 3) was chosen for transformation into 
maize plants. An expression cassette comprising the v//73C(a) sequence was transferred 
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to a suitable vector for Agrobacterium-mediated maize transformation. For this 
example, an expression cassette comprised, in addition to the vi)73C(a) gene, the maize 
ubiquitin promoter and the nos termmater which are known m the art, as well as the 
phosphomannose isomerase (PMI) gene for selection of transgenic lines (Negrotto et 
al (2000) Plant Cell Reports 19: 798-803). The resultmg vector was designated 
pNOV2149 (SEQ ID NO: 30). 

[00172] Transformation of immature maize embryos was performed essentially as 
described in Negrotto etal, 2000, Plant Cell Reports 19: 798-803. For this example, 
all media constituents were as described in Negrotto et al, supra. However, various 
media constituents known in the art may be substituted. 

[00173] Agrobacterium strain LBA4404 (pSBl) containing the plant transformation 
plasmid was grown on YEP (yeast extract (5 g/L), peptone (lOg/L), NaCl (5g/L), 15g/l 
agar, pH 6.8) solid medium for 2 - 4 days at 28°C. Approximately 0.8X 10^ 
Agrobacterium were suspended in LS-inf media supplemented with 100 ^lM As 
(Negrotto et al.X2000) Plant Cell Rep 19: 798-803). Bacteria were pre-induced in this 
medium for 30-60 minutes. 

[00174] Immature embryos from the A188 maize genotype were excised from 8-12 
day old ears into liquid LS-inf + 100 piM As. Embryos were rinsed once with fresh 
infection medium. Agrobacterium solution was then added and embryos were 
vortexed for 30 seconds and allowed to settle with the bacteria for 5 minutes. The 
embryos were then transferred scutellum side up to LSAs medium and cultured in the 
dark for two to three days. Subsequently, between 20 and 25 embryos per petri plate 
were transferred to LSDc medium supplemented with cefotaxime (250 mg/1) and silver 
nitrate (1.6 mg/1) and cultured in the dark for 28°C for 10 days. 

[00175] Immature embryos, producing embryogenic callus were transferred to 

LSD1M0.5S medium. The cultures were selected on this medium for 6 weeks with a 
subculture step at 3 weeks. Surviving calli were transferred to Regl medium 
supplemented with mannose. Following culturing m the light (16 hour light/ 8 hour 
dark regiment), green tissues were then transferred to Reg2 medium without growth 
regulators and incubated for 1-2 weeks. Plantlets were transferred to Magenta GA-7 
boxes (Magenta Corp, Chicago 111.) containing Reg3 medium and grown in the light. 
Afker 2-3 weeks, plants were tested for the presence of the PMI genes and the vi>3C(a) 



51 



wo 03/075655 



PCTAJS03/04735 



gene by PCR. Positive plants from the PGR assay were transferred to the greenhouse 
and tested for resistance to lepidopteran pests. 

Example 8. Analysis of Transgenic Maize Plants 

[00176] Plants were sampled as they are being transplanted from Magenta GA-7 boxes 
into soil. Sampling consisted of cutting two small pieces of leaf (ca. 2-4 cm long) 
and placing each in a small petri dish. Negative controls were either transgenic plants 
that were PCR negative for the v/p3C(a) gene from the same experiment, or from non- 
transgenic plants (of a similar size to test plants) that were being grown in the 
phytotron. 

[00177] Leaf samples from each plant were inoculated with either European com borer 
{Ostrifua mbilalis) or fell armyworm {Spodcpterafru^perda) by placing 10 first 
instar larvae onto each leaf piece. Petri dishes were then tightly sealed. 

[00178] At 3-4 days post inoculation, data were collected. The percent mortality of the 
larvae was calculated along with a visual damage rating of the leaf. Feeding damage 
was rated as high, moderate, low, or absent and given a numerical value of 3, 2, 1 or 0, 
respectively. 

[00179] Results shown in Table 9 indicate that transgenic maize plants conq)rismg the 
vz/73C(a) gene and expressing the Vip3C(a) protein, are insecticidal to European com 
borer (ECB) and fall armyworm (FAW). 



Table 9. 



Event 


Plant No. 


ECB 

Mortality 


Damage 

Rating 


FAW 

Mortality 


Damage 
Rating 


557 


12A 


80% 


2 


100% 


0 


557 


20B 


100% 


1 


100% 


0 


557 


8A 


80% 


1 


70% 


0 


557 


11A 


100% 


2 


100% 


0 


557 


16B 


95% 


2 


95% 


0 


557 


18B 


90% 


2 


100% 


0 


557 


148 


100% 




100% 


0 


556 


1A 


100% 




100% 


0 


556 


38 


80% 




100% 


0 


556 


4A 


95% 




100% 


0 


556 


13A 


100% 




100% 


0 


A188 


NEG 


0 


10 


0% 


10 
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Example 9, Hybrid Vip3 Toxins. 

[00180] Vip3C is toxic to Ostrinia nubikdis (European com borer) and Plutella 
xylostella (diamond back moth), whereas homologous Vip3 toxms, for example, 
VipS A(a), Vip3A(b), and Vip3A(c) are not. Vip3C and Vip3 A differ primarily in the 
C-terminal region of their respective amino acid sequences particularly in the region 
from amino acid 661 to amino acid 788 of SEQ ID NO: 2. In order to demonstrate that 
this C-terminal region of Vip3C is the portion of the Vip3C toxin that is responsible 
for the activity against European com borer and diamond back moth, a hybrid toxin 
comprising the C-terminal region of Vip3C, amino acid number 661 to amino acid 
number 788 of SEQ ID NO: 2, was joined in an amino to caiboxy direction with the 
N-terminal region, from amino acid number 1 to amino acid number 660 of SEQ ID 
NO: 5, of Vip3 A. This hybrid toxin was designated Vip3 A-C. 
[00181] A nucleic acid molecule encoding the Vip3 A-C hybrid toxin, was constmcted 

using two steps of PCR with the following primers: 
Vip3A-N: 3'-CACCATGAACAAGAATAATACTAAATrAAGCACAAGAG-3'(SEQ IDNO: 24) 
Vip3A2050: 5*-TAAAGTrATCrCCCCAAGCITCATCrCCA.3' (SEQ ID NO: 25) 

Vip3C-Cl: 5'-AATGGAGATGAAGCTrGGGGAGAT-3' (SEQ ID NO: 26) 

Vip3C-C2: 3*-TTrATITAATAGAAACGTnTCAAATGATATATG.3' (SEQ ID NO: 27) 
[00182] In the first PCR step primers Vip3 A-N (SEQ ID NO: 24) and Vip3 A2050 
(SEQ ID NO: 25) were used to generate an approximately 2,0 kb fragment of the 5' 
end of the vip3A gene, encoding the N-terminal region, and primers Vip3C-Cl (SEQ 
ID NO: 26) and Vip3C-C2 (SEQ ID NO: 27) were used to generate an approximately 
0.4 kb fragment of the 3' end of the v/>3C gene, encoding the C-terminal region. In the 
second PCR step, these two fragments were combined as the templates for primers 
Vip3 A-N (SEQ ID NO: 24) and Vip3C-C2 (SEQ ID NO: 27) to generate an 
approxunately 2.4 kb hybrid vi/73 A-vi/73C gene, designated v//?3A-C. 
[00183] A hybrid v/p3 A-v573C(b) gene was made, the sequence of which is set forth in 
SEQ ID NO: 10. The hybrid vip3 A-C gene was cloned into pETlOlD (Novagen), and 
the resulting vector designated pNOV3912, and transformed into E. coli DH5a for 
expression. This E, coli clone, (NRRL B-30551), was tested agmnst the insect species 
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listed in Table 10. The Vip3C protein was used as comparative controls. Data were 
compared to the known activity spectrum of Vip3A. 
[00184] The results shown in the Table 10 confirm that the C-terminal region of Vip3C, 
amino acid number 661 to amino acid number 788 of SEQ ID NO: 2, is sufficient to 
confer European com borer and diamond back moth activity on the hybrid toxin. 

Table 10 



% Insect Mortality Activity 
Spectrum of 



Insect Tested 


Vip3A-C 


VipSCCb)" 


VipSA" 


Agrotis ipsilon 


100 


100 


+ 


Helicoverpa zea 


100 


75" 


+ 


Heliothis virescens 


60 


50 


+ 


Spodoptera exigua 


80 


100 


+ 


Spodoptera frugiperda 


70* 


70' 


+ 


Trichoplusia ni 


80 


100 


+ 


Pectinophora gossypiella 


80 


60' 


+ 


Cockylis hospes 


100 


90 


+ 


Homoeosoma electeUum 


40'" 


30' 


+ 


Ostrinia mbilalis 


100 


100 




Plutellaxylostella 


100 


100 





'Surviving insects were observed to have severe feeding and growth inhibition. 
•Data firom Example 6. 

®A indicates an insect species that is susceptible to Vip3 A. A indicates an 
insect species with little or no susceptibility to Vip3 A. 

Example 10. In vitro Recombination of v//?3 Genes by DNA Shuffling 

[00185] One of the v/p3 genes of the present invention (SEQ ID NO: 1, 3, or 1 1) is 
amplified by PGR. The resulting DNA fi-agment is digested by DNasel treatment 
essentially as described in Stemmere/ a/., ™5 91: 10747-10751 (1994), and the 
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PCR primers are removed from the reaction mixture. A PGR reacdon is carried out 
without primers and is followed by a PCR reaction with the primers, both as described 
in Stammer et aL (1994). The resulting DNA fragments are cloned into pTRC99a 
(Pharmacia, Cat no: 27-5007-01) and transformed into Kcoli strain SASX38 by 
electroporation using the Biorad Gene Pulser and the manufacturei's conditions. The 
transformed bacteria are grown on medium overnight and screened for insecticidal 
activity. 

[00186] In a similar reaction, PCR-amplified DNA fragments comprising one of the 
v/p3 genes described herein (SEQ ID NO: 1, 3, 5, 7, 9, or 11, or mutants thereof), and 
PCR-amplified DNA fragments comprising at least one other of the vip3 genes 
described herein (or a mutant thereof) are recombined in vitro and resulting variants 
with improved insecticidal properties are recovered as described below. 

[00187] n order to increase the diversity of the shuffled vip3 gene library, a vip3 gene 
or genes (called the primary genes) are shuffled using synthetic oligonucleotide 
shuffling. A plurality (e.g., 2, 5, 10, 20, 50, 75, or 100 or more) of oligonucleotides 
corresponding to at least one region of diversity are synthesized. These 
oligonucleotides can be shuffled directly, or can be recombined with one or more of 
the family of nucleic acids. 

[00188] The oligonucleotide sequence can be taken from other vz/73 genes called 
secondary genes. The secondary genes have a certain degree of homology to the 
primary genes. There are several ways to select parts of the secondary gene for the 
oligonucleotide synthesis. For example, portions of the secondary gene can be selected 
at random. The DNA shuffling process will select those oligonucleotides, which can 
be incorporated into the shuffled genes. 

[00189] The selected portions can be any lengths as long as they are suitable to 

synthesize. The oligonucleotides can also be designed based on the homology between 
the primary and secondary genes. A certain degree of homology is necessary for 
crossover, which must occur among DNA fragments during the shuffling. At the same 
time, strong heterogeneity is desired for the diversity of the shuffled gene library. 
Furthermore, a specific portion of the secondary genes can be selected for the 
oligonucleotide synthesis based on the knowledge in the protein sequence and function 
relationship. 
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[00190] The present invention has disclosed that the C-tenninal domain of Vip3 is in 
part responsible for spectrum of acti^ty of the Vip3 toxins. When the insecticidal 
spectrum is modified by the current invention utilizing the DNA shuffling technology, 
the C-terminal region of the nucleotide sequence of the secondary genes can be 
selected as a target region for synthesizing oligonucleotides used in an oligonucleotide 
shuffling procedure. 

[00191] Since tiie insecticidal activity of the Vip3 protein is dependent, at least in part, 
to the N-terminal region, the N-terminal region of the secondary genes can be selected 
for oligonucleotide shuffling for increased insecticidal activity. 

[00192] In one aspect, the primary vip3C{a) and v7p3C(b) genes are shuffled with 
several oligonucleotides that are synthesized based on the secondary v^p3 A gene 
sequence. Fifp3C(a) and v/p3C(b) are highly homologous, but v/p3A is substantially 
different from these genes. Therefore, it is desirable to shuffle v?>3 A along with tiie 
v//73C(a) and v//?3C(b) to increase the diversity of resulting shuffled recombinant 
nucleic acids. Portions of the vip3 A sequence, which are substantially different from 
the corresponding portions of v/p3C(a) and wp3C(b), are selected, and a series of 50- 
mer oligonucleotides that cover these portions are synthesized. These oligonucleotides 
are shuffled with tiie v/p3C(a) and v//?3C(b). A certain number of the clones are tiien 
selected from the shuffled gene library and examined for the diversity by restriction 
mapping. The diversity is contemplated to be more than normally expected from the 
shuffling of wip3C(a) and v//;3C(b) alone. 

Example 11. High-throughput Screen for Insecticidal Activity. 

[00193] Shuffled vip3 gene libraries in either E. coli or Bacillus thuringiensis are 
screened for insecticidal activity. Colonies are picked with a Q-bot (Beckman), placed 
in growth media in a standard 96-well format and grown over night. Each clone is then 
layered onto the surface of an insect diet m 96-well format and the surface allowed to 
dry. Optionally, pools of transformed cells are added to each well to increase the 
number of clones that are tested in the initial screening round. For example, screening 
100 clones per well and using 10,000 wells provides a screen of 10^ clones. 
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[00194] Several neonate larvae of a target insect, for example, Heliothis virescens, 
Helicoverpa zea or Spodoptera firuffpenJa, are added to each well. The plate is 
covered with an air permeable membrane that retains the larvae in the wells into which 
they were placed. After 5 days the wells are evaluated for amount of diet consumed 
and/or insect mortality. Clones in wells indicating that little or no diet is consumed 
and/or where high insect mortality is observed are chosen for further analysis. Several 
clones should be found to have enhanced activity agamst the tai^get insect. 

Example 12: Cosmid Cloning a Full-length vipZC gene 

[00195] Total DNA was isolated from C1674 (NRRL B-30556) by treating freshly 
grown cells resuspended in 100 mM Tris pH 8, 10 mM EDTA with 2 mg/ml lysozyme 
for 30 minutes at Proteinase K was added to a final concentration of 100 jig/ml 
in 1% SDS, 50mM EDTA, IM urea and incubated at 55°C. An equal volume of 
phenol-chloroform-isoamyl alcohol was added. The sample was gently mixed for 5 
minutes and centrifuged at 3K. This was repeated twice. The aqueous phase was then 
mixed with 0.7 volumes isopropanol and centrifuged. The DNA pellet was washed 
three times with 70% ethanol and gently resuspended in 0.5X TE. 12 ng of DNA were 
treated with 0.3 unit oiSctuSA per jig of DNA at 37''C in a volume of 100 jil. Samples 
were taken at 2-min mt^als for 10 minutes. Then 1/10 volume lOX TE was added 
and samples were heated for 30 minutes at 65X to inactivate the enzyme. The samples 
were subjected to electrophoresis to determine which fraction is in the 40-kb range and 
this sample was used in the ligation. 

[00196] SuperCos cosmid vector (Stratagene, La Jolla, CA) was prepared as described 
by the supplier utilizing the BarnHI cloning site. Prepared SuperCos at 100 ng/ml was 
ligated with the C1674 DNA previously digested with Sau3A at a ratio of 2: 1 in a 5 |al 
volume overnight at 6^C. The ligation mbcture was packaged using Gigapack XL IQ 
(Stratagene) as described by the supplier. Packaged phages were infected into XL- 
IMR jE. coli cells (Stratagene) as described by the supplier. The cosmid library was 
plated on L-agar with SO ^g/ml kanamycin and incubated 16 hours at 37*^C. 200 
colonies were picked and grown for screening for the presence of the vip2C gene. 
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[00197] The 200 cosmid clones were screened for the presence of the vip3C gene by 

PGR using vip3C specific primers. 
[00198] Two cosmid clones were shown to comprise a vip3C coding sequence. After 

several sequencing runs the sequence was confirmed to be the sequence set forth in 

SEQ E) NO: 3 1. This vip3C coding sequence was designated vi/;3C-12168 and 

encodes the Vip3C-12168 protein (SEQ ID NO: 32). 

Example 13: Bioassay of Vip3C-12168. 

[00199] £. coli cells comprising an expression vector (pTrcHis; Invitrogen) comprising 
the v(p3C-12168 coding sequence were tested for biological activity using the 
protocol described in Example 6. The insect species tested were, European com 
borer (ECB), fall armyworm (FAW), black cutworm (BCW), tobacco 
budworm (TBW), and com earworm (CEW). Larval mortality, as well as the 
occurrence of feeding and growth inhibition, was recorded at 7 days after larval 
infestation of the diet. A sample containing E. coli cells with an empty 
expression vector (pTrcHis) acted as the negative control. E, coli cells 
expressing the STcndotoxin Cryl Ab and £1 coli cells expressing Vip3A protein 
were also tested in the same bioassay for comparison of spectrum of activity. 

[00200] Results are shown in Table 11 . The data show that Vip3C-12168 has the same 
spectrum of activity as a combination of Cryl Ab and Vip3 A. 



Table 11. 



Treatment 

CrylAb 


% Morta 


ity 


ECB 


FAW 


BCW 


TBW 


CEW 


100 


0 


10 


0^ 


8 


VIpSA 


0 


100 


100 


83^ 


100 


Vip3C-12168 


100 


100 


100 




100 


PTrcHis (empty vector) 


0 


0 


10 


0 


8 



•Growth inhibition; Teeding inhibition 
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Example 14: Maize Optimized Vip3C-12168 

[00201] A maize optimized V3p3C-12168 coding sequence was designed according to 
the procedure described in Example 5. The nucleotide sequence of the maize 
optimized V//73012168 coding sequence is shown in SEQ ID NO: 33. 

[00202] All publications and patent applications mentioned in this specification are 
indicative of the level of skill of those skilled in the art that this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent appUcation was specifically and 
individually indicated to be incorporated by reference. 

[00203] It should be understood that the examples and embodiments described herein 
are for illustrative purposes only and that various modifications or changes in light 
thereof will be suggested to persons skilled in the art and are to be included within the 
spirit and purview of this application and the scope of the appended claims. 
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What is claimed is: 

1 . An isolated nucleic add molecule comprising a nucleotide sequence that encodes a 
toxin that is active against insects, wherem said nucleotide sequence: 

(a) has a compliment that hybridizes to nucleotides 198 1-2367 of SEQ ID NO: 
1 in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 
50**C. with washing in O.IXSSC, 0.1% SDS at 65X.; or 

(b) is isocoding with the nucleotide sequence of (a); or 

(c) has at least 93% sequence identity with SEQ ID NO: 1; or 

(d) encodes an amino acid sequence having at least 91% sequence identity with 
SEQ ID NO: 2. 

2. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence has a compliment that hybridizes to nucleotides 1981-2367 of SEQ ID NO: 1 in 
7% soduim dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50*^0. with washing in 
O.IXSSC, 0.1%SDSat65°C. 

3. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence is isocoding with a nucleotide sequence having a compliment that hybridizes to 
nucleotides 1981-2367 of SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04, 1 mM EDTA at 50°C. with washing in O.IXSSC, 0.1% SDS at 65°C. 

4. The isolated nucleic acid molecule according to claim 1 comprising a nucleotide 
sequence that has at least 75% sequence identity with nucleotides 1981-2367 of SEQ ID 



5. The isolated nucleic acid molecule according to claim 1 comprising a nucleotide 
sequence that has at least 85% sequence identity with nucleotides 1981-2367 of SEQ ID 
NO: 1. 



NO: 1. 
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6. The isolated nucleic acid molecule according to cldm 1 comprising a nucleotide 
sequence that has at least 95% sequence identity with nucleotides 1981-2367 of SEQ ID 
NO: 1. 

7. The isolated nucleic acid molecule accordmg to claim 1 comprising a nucleotide 
sequence that has at least 99% sequence identity with nucleotides 1981-2367 of SEQ K> 
NO: 1. 

8. The isolated nucleic acid molecule according to claim 1 comprising nucleotides 
1981-2367 of SEQ ID NO: 1 or SEQ ID NO: 3. 

9. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence has at least 93% sequence identity with SEQ ID NO: L 

10. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence has at least 95% sequence identity with SEQ H) NO: 1 . 

1 1 . The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence has at least 99% sequence identity with SEQ ID NO: 1. 

12. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence set forth in SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 1 1, SEQ ID NO: 32 or 
SEQ ID NO: 34. 

13. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence of SEQ ID NO: 1, 

14. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence of SEQ ID NO: 3. 

15. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence of SEQ ID NO: 1 1. 
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16. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence of SEQ ID NO: 32. 

17. The isolated nucleic acid molecule according to claim 1 comprising the nucleotide 
sequence of SEQ ID NO: 34. 

18. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes an amino acid sequence which has at least 91% identity to SEQ ID NO: 
2. 

19. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes an amino acid sequence which has at least 95% identity to SEQ ID NO: 
2. 

20. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes an amino acid sequence which has at least 99% identity to SEQ ID NO: 
2. 

21. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes the amino acid sequence set forth in SEQ ID NO: 2, SEQ ID NO: 12, or 
SEQ ID NO: 33. 

22. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes the amino acid sequence set fortii in SEQ ID NO: 2. 

23. The isolated nucleic acid molecule according to claim 1, wherein said nucleotide 
sequence encodes the ammo acid sequence set forth in SEQ ID NO: 12. 

24. The isolated nucleic add molecule according to claim 1, wherein said nucleotide 
sequence encodes the amino acid sequence set forth in SEQ ID NO: 33. 
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25. The isolated nucleic acid molecule according to claim 1 which is comprised in a 
Bacillus thuringiensis isolate selected from the group consisting of C1674, designated 
NRRL accession B-30556; and C536, designated NRRL accession B-30557. 

26. The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in an £ coli clone selected from the 
group consisting of pNOV3910, designated NRRL accession B-30553; pNOV391 1, 
designated NRRL accession B-30552; pNOV3906, designated NRRL accession B-30555; 
pNOV3905, designated NRRL accession B-30554; and pNOV3912, designated NRRL 
accession B-3055L 

27. The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in E. coli clone pNOV3910, designated 
NRRL accession B-30553. 

28. The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in E. coli clone pNOV3911, designated 
NRRL accession B-30552, 

29. The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in £. coli clone pNOV3906, designated 
NRRL accession B-30555. 

30. The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in E. coli clone pNOV3905, designated 
NRRL accession B-30554. 

3L The isolated nucleic acid molecule according to claim 1 which comprises the 
approximately 2.4 kb DNA fragment comprised in K coli clone pNOV3912, designated 
NRRL accession B-3055L 
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32. The isolated nucleic acid molecule according to claim 1, wherein said toxin 
comprises an amino acid sequence with at least 75% identity with amino acids 681-788 of 
the amino acid sequence of SEQ ID NO: 2. 

33. The isolated nucleic acid molecule according to claim 1, wherein said toxin 
comprises an amino acid sequence which has at least 85% identity with ammo acids 681- 
788 of the amino acid sequence of SEQ ID NO: 2. 

34. The isolated nucleic acid molecule according to claim 1, wherein said toxin 
comprises an amino acid sequence which has at least 95% identity with amino acids 681- 
788 of the amino acid sequence of SEQ ID NO: 2. 

35. The isolated nucleic acid molecule according to claim 1, wherein said toxin 
comprises an amino acid sequence which has at least 99% identity with amino acids 681- 
788 of the amino acid sequence of SEQ ID NO: 2. 

36. The isolated nucleic acid molecule according to claim 1, wherein said toxin 
comprises amino acids 681-788 of the amino acid sequence of SEQ ID NO: 2. 

37. The isolated nucleic acid molecule according to claim 1, wherein said toxin is 
active against a lepidopteran insect. 

38. The isolated nucleic acid molecule according to claim 37, wherein said 
lepidopteran insect is selected from the group consisting of: Ostrinia nubilalis (European 
com borer), Plutella xylostella (diamondback moth), Spodoptera frugiperda (fall 
armyworm), Agrotis ipsilon (black cutworm), Helicoverpa zea (com earworm), Heliothis 
virescem (tobacco budworm), Spodoptera exigita (beet armyworm), Pectinophora 
gossypiella (pink boll worm), Trichophisia ni (cabbage looper), Cochyles hospes (banded 
sunflower moth), and Homoeosoma electelhm (sunflower head moth). 

39. A chimeric gene comprising a heterologous promoter sequence operatively linked 
to the nucleic acid molecule of claim 1. 
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40. A recombinant vector comprising the chimeric gene of claim 39. 

41 . A transgenic host cell comprising the chimeric gene of cMm 39. 

42. The transgenic host cell according to claim 41, which is a bacterial cell. 

43 . The transgenic host cell according to claim 41, which is a plant cell. 

44. A transgenic plant comprising the transgenic plant cell of claim 43. 

45. The transgenic plant according to claim 44, wherein said plant is selected from the 
group consisting of sorghum, wheat, sunflower, tomato, cole crops, cotton, rice, soybean, 
sugar beet, sugarcane, tobacco, barley, oilseed rape and maize. ' ■ 

46. The transgenic plant according to claim 45, wherein said plant is a maize plant. 

47. Transgenic seed from the transgenic plant of claim 44. 

48. Transgenic seed from the maize plant of claim 46. 

49. An isolated toxin that is active against insects, wherein said toxin comprises an 
amino acid sequence that: 

a) has at least 75% identity with amino acids 661-788 of SEQ ID NO: 2; or 

b) has at least 91% identity with SEQ ID NO: 2; or 

c) is produced by the expression of a nucleic acid molecule comprising a 
nucleotide sequence that has a compliment that hybridizes to nucleotides 1981- 
2367 of SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 
mMEDTA at 50°C. with washing in O.IXSSC, 0.1% SDS at 65^C.; or 

d) is produced by the expression of a nucleic acid molecule comprising a 
nucleotide sequence that is isocoding with the nucleotide sequence of (c); or 
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e) is produced by the expression of a nucleic acid molecule comprising a 

nucleotide sequence that has at least 93% sequence identity with SEQ ID NO: 
1. 

50. The isolated toxin according to claim 49, wherein said toxin comprises an amino 
acid sequence which has at least 75% identity with amino acids 661-788 of the amino acid 
sequence of SEQ ID NO: 2. 

5 1 . The isolated toxm according to claim 49 comprising an amino acid sequence which 
has at least 85% identity with amino acids 661-788 of the amino acid sequence of SEQ ID 
NO: 2. . 

52. The isolated toxin according to claim 49 comprising an amino acid sequence which 
has at least 95% identity with amino acids 661-788 of the amino acid sequence of SEQ ID 
NO: 2. 

53. The isolated toxin accordmg to claim 49 comprising an amino acid sequence which 
has at least 99% identity with amino acids 661-788 of the amino acid sequence of SEQ ID 
NO: 2. 

54. The isolated toxin according to claim 49 comprising amino acids 661-788 of the 
amino acid sequences set forth in SEQ ID NO: 2. 

55. The isolated toxin according to claim 49 comprising an amino acid sequence which 
has at least 91% identity with the amino acid sequence set forth in SEQ ID NO: 2. 

56. The isolated toxin according to claim 49 comprising an amino acid sequence which 
has at least 95% identity with the amino acid sequence set forth in SEQ ID NO: 2. 

57. The isolated torn according to claim 49 comprising an amino acid sequence which 
has at least 97% identity with the amino acid sequence set forth in SEQ ID NO: 2. 
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58. The isolated toxin according to claim 49 comprising an amino acid sequence which 
has at least 99% identity with the amino acid sequence set forth in SEQ ID NO: 2. 

59. The isolated toxin according to claun 49 comprising the ammo acid sequence set 
forth in SEQ ID NO: 2, SEQ ID NO: 12, or SEQ ID NO: 33. 

60. The isolated toxin according to claun 59 comprising the amino acid sequence set 
forth in SEQ ID NO: 2. 

61. The isolated toxin according to claim 59 comprising the amino acid sequence set 
forth in SEQ ID NO: 12. 

62. The isolated toxin according to claim 59 comprising the amino acid sequence set 
forth in SEQ ID NO: 33. 

63. The isolated toxin according to claim 49, where said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence that has a 
compliment that hybridizes to nucleotides 1981-2367 of SEQ ID NO: 1 in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM EDTA at 50T. with washing in 0. IXSSC, 
ai%SDSat65°C. 

64. The isolated toxin according to claim 49, where said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence that is isocoding 
with a nucleotide sequence having compliment that hybridizes to nucleotides 1981-2367 
of SEQ ID NO: 1 in 7% sodium dodecyl sulfate (SDS), 0.5 MNaP04, 1 mM EDTA at 
50°C. with washing in 0. IXSSC, 0.1% SDS at 65T. 

65. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 75% 
identity to nucleotides 1981-2367 of SEQ ID NO: 1, 
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66. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence wth at least 85% 
identity to nucleotides 1981-2367 of SEQ IDNO: 1. 

67. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 95% 
identity to nucleotides 1981-2367 of SEQ IDNO: 1. 

68. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 99% 
identity to nucleotides 1981-2367 of SEQ ID NO: 1. 

69. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising nucleotides 1981-2367 of SEQ ID NO: 1 
or SEQ ID NO: 3. 

70. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 93% 
sequence identity with SEQ ID NO: 1. 

71 . The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 95% 
sequence identity with SEQ ID NO: 1. 

72. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising a nucleotide sequence with at least 99% 
sequence identity with SEQ ID NO: 1. 

73. The isolated toxin according to claim 49, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising the nucleotide sequence set forth in SEQ 
ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 11, SEQ ID NO: 32, or SEQ ID NO: 34. 
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74. The isolated toTcin according to claim 73, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising SEQ ID NO: 1. 

75. The isolated toxin according to claim 73, wherein said toxin is produced by the 
e^qpression of a nucleic acid molecule comprising SEQ ID NO: 3. 

76. The isolated toxin according to claim 73, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising SEQ ID NO: 1 1. 

77. The isolated toxin according to claim 73, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising SEQ ID NO: 32. 

78. The isolated toxin according to claim 73, wherein said toxin is produced by the 
expression of a nucleic acid molecule comprising SEQ ID NO: 34. 

79. The isolated toxin according to claim 49, wherein said toxin has activity against a 
lepidopteran insect. 

80. The isolated toxin according to claim 79, wherein said lepidopteran insect is 
selected from the group consisting of Ostrinia mbilalis (European com borer), Pluiella 
o^lostella (diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon 
(black cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worm), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

81 . The isolated toxin according to claim 49, wherein said toxin is produced by a 
Bacillus thuringiensis strain selected from the group consisting of CI 674, designated 
NRKL accession B-30556; and C536, designated NKRL accession B-30557. 

82. The isolated toxin accordmg to claim 81, wherein said toxin is produced by 
Bacillus thuringiensis strain C1674, designated NRRL accession B-30556. 
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83. The isolated toxin according to claim 81, wherein said toxin is produced by 
Bacillus Umrin^ensis strain CS36, designated NRRL accession B-30SS7. 

84. The isolated toxin according to claim 49, wherein said toxin is produced by an E, 
coli clone selected from the group consisting of pNOV3910, designated as l^RRL 
accession B-30553; pNOV391 1, designated NRRL accession B-30552; pNOV3906, 
designated NRRL accession B-30555; pNOV3905, designated NRRL accession B-30554; 
and pNOV3912, designated NRRL accession B-3055L 

85. The isolated toxin according to claim 84, wherein said toxin is produced by the £. 
coli clone designated as NRRL accession B-30553. 

86. The isolated toxin according to claim 84, wherein said toxin is produced by the E. 
coli strain designated as NRRL accession B-30552. 

87. The isolated toxin according to claim 84, wherein said toxin is produced by the E, 
coli clone designated as NRRL accession B-30555. 

88. The isolated toxin according to claim 84, wherein said toxin is produced by the E, 
coli clone designated as NRRL accession B-30554. 

89. The isolated toxin according to claim 84, wherein said toxin is produced by the E, 
coli clone designated as NRRL accession B-3055L 

90. A composition comprising an efifective insect-controlling amount of the toxin 
accordmg to claim 49. 

9L A method of producing a toxin that is active against insects, comprising: 

(a) obtaining the transgenic host ceU according to claim 41; 

(b) culturing said transgenic host cell under conditions that permit 
production of the toxin; and 
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(c) recovering smd toxin. 



92. A method of producing an insect-resistant transgenic plant, comprising introducing 
the nucleic acid molecule according to claim 1 into a plant cell; and regenerating a 
transformed plant from said plant cell, wherein said transformed plant is insect resistant. 

93 . The method of claim 92, wherein said insects are lepidopteran insects. 

94. The method of clmm 93, wherein said lepidopteran insects are selected from the 
group consisting of: Ostrinia mbilalis (European com borer), Plutella xylostella 
(diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

95. A method of controlling insects, comprising delivering to said insects an effective 
amount of the toxin according to claim 49. 

96. The method of claim 95, wherein said insects are lepidopteran insects. 

97. The method of claim 96, wherein said lepidopteran insects are selected from the 
group consisting of: Ostrinia rmbilalis (European com borer), Plutella xylostella 
(diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (simflower head moth). 

98. The method of claim 95, wherein said toxin is delivered to the insects orally. 
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99. A method of protecting a maize plant against at least one insect pest, comprising: 
mtroducing the nucleic acid molecule according to claini 1 in a maize cell; and 
regenerating a stably transformed maize plant, wherein the transformed maize plant 
produces an msecticidal toxin m an amount sufBcient to protect the maize plant against at 
least one insect pest. 

100. A hybrid toxin active against insects, wherein said hybrid toxin is encoded by a 
nucleic acid molecule comprising a nucleotide sequence according to claim 1. 

101. The hybrid toxin according to claim 100, wherein said insects are lepidopteran 
insects. 

102. The hybrid toxin according to claim 101, wherein said lepidopteran insects are 
selected jfrom the group consisting of Ostrinia nubilalis (European com borer), Plutella 
xylostella (diamondback moth), Spodoptera frugiperda (fall armyworm), Agrotis ipsilon 
(black cutworm), Helicoverpa zea (com earworm), Heliothis virescem (tobacco 
budworm), Spodoptera exigua (beet armyworm), Pectinophora gossypiella (pink boll 
worai), Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

103. The hybrid toxin according to claim 100, wherein said hybrid toxin is encoded by 
the nucleotide sequence set forth in SEQ ID NO: 10. 

104. A composition comprising an insecticidally effective amount of the hybrid toxin 
according to claim 100. 

105. A method of producing a hybrid toxin active against insects, comprising: 

(a) obtaining the transgenic host cell according to claim 41; 

(b) culturing said transgenic host cell under conditions that permit 
production of the hybrid toxin; and 

(c) recovering said hybrid toxin. 
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106. A method of producing an insect-resistant transgenic plant, comprising introdudng 
the nucleic acid molecule according to claim 1 into a plant cell; and regenerating a 
transformed plant firom said plant cell, wherein said transformed plant produces a hybrid 
toxin and is insect resistant. 

107. The method of claim 106, wherem said insect is a lepidopteran insect. 

108. The method of claim 107, wherein said lepidopteran insect is selected from the 
group consisting of Ostrinia mhilalis (European com borer), Plntella xylostella 
(diamondback moth), Spodopiera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodopiera exigua (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

109. A method of controlling an insect comprising delivering to said insect an effective 
amount of the hybrid toxin according to claim 100. 

1 10. The method of claim 109, wherein said insect is a lepidopteran insect. 

111. The method of claim 1 10, wherein said lepidopteran insect is selected from the 
group consisting of Ostrinia mbilalis (European com borer), Pluiella xylostella 
(diamondback moth), Spodopiera frugiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodopiera exigua (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

1 12. The method of claim 109, wherein said hybrid toxin is delivered to the insect 
orally. 
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113. A hybrid toxin active against insects, comprising a carboxy-teraiinal region of a 
Vip3 toxin joined in the amino to carboxy direction to an amino-terminal region of a 
dififerent Vip3 toxm, wherein said cart)0>jy-terminal region comprises an amino acid 
sequence which has at least 75% identity with amino acids 661-788 of SEQ ID NO: 2; and 
wherein said amino-terminal region has at least 85% identity with amino acids 1-660 of 
SEQ ID NO: 6. 

1 14. The hybrid toxin according to claim 1 13, wherem said caibojqr-terminal region 
comprises amino acids 661-788 of SEQ ID NO: 2, and wherein said amino-terminal 
region comprises amino acids 1-660 of SEQ ID NO: 6. 

115. The hybrid toxin according to claim 1 1 3, wherein said hybrid toxin comprises 
amino acids 1 -788 of SEQ ID NO: 12. 

1 16. A method of controlling an insect comprising delivering to said insect an effective 
amount of the hybrid toxin according to claim 113. 

117. The method of claim 1 16, wherein said insect is a lepidopteran insect. 

118. The method of claim 1 1 7, wherein said lepidopteran insect is selected from the 
group consisting of Ostrinia tmbilalis (European com borer), Plutella xylostella 
(diamondback moth), Spodopiera fnigiperda (fall armyworm), Agrotis ipsilon (black 
cutworm), Helicoverpa zea (com earworm), Heliothis virescens (tobacco budworm), 
Spodopiera exigua (beet armyworm), Pectinophora gossypiella (pink boll worm), 
Trichoplusia ni (cabbage looper), Cochyles hospes (banded sunflower moth), and 
Homoeosoma electellum (sunflower head moth). 

119. A nucleic acid molecule comprising a nucleotide sequence that encodes the hybrid 
toxin according to claim 113. 

120. An isolated vipZ nucleic acid molecule comprising a nucleotide sequence that: 

a) comprises SEQ ID NO: 8; or 
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b) encodes tiie amino acid sequence set forth in SEQ ID NO: 9. 



\ 
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SEQUENCE LISTING 

<110> syngenta Participations AG 
Shen, Zhi Cheng 
Warren, Gregory 
Shotkoski, Frank 
Kramer, Vance 

<120> Novel Vip3 Toxins and Methods of Use 

<130> 60163PCT 

<150> US 60/362250 
<151> 2002-03-06 

<160> 33 

<170> Patentin version 3.2 

<210> 1 

<211> 2367 

<212> DNA 

<213> Bacillus thuringiensis 



<220> 

<221> misc_feature 

<222> (1)..(2367) 

<223> Native vip3C coding sequence. 



j\n "r" at position 2213 represents the nucleotide g or a. 

atgaacaaga ataatactaa attaagcaca agagccctac cgagttttat tgattatttt 
aatggcattt atggatttgc cactggtatc aaagacatta tgaatatgat ttttaaaacg 
gatacaggtg gtaatctaac cttagacgaa atcctaaaga atcagcagtt actaaatgag 
atttctggta aattggatgg ggtaaatggg agcttaaatg atcttatcgc acagggaaac 
ttaaatacag aattatctaa ggaaatctta aaaatcgcaa atgaacagaa tcaagtctta 
aatgatgtta ataacaaact cgatgcgata aatacgatgc ttcatatata tctacctaaa 
attacatcta tgttaagtga tgtaatg^ag caaaattatg cgctaagtct gcaaatagaa 
tacttaagta agcaattgca agaaatttct gataaattag atattattaa cgtaaatgtt 
cttattaact ctacacttac tgaaattaca cctgcatatc aacggattaa atatgtgaat 
gaaaaatttg aagaattaac ttttgctaca gaaaccactt taaaagtaaa aaaggatagc 
tcgcctgctg atattcttga tgagtt^act gaattaactg aactagcgaa aagtgttaca 
aaaaatgacg ttgatggttt tgaattttac cttaatacat tccacgatgt aatggtagga 
aataatttat tcgggcgttc agctttaaaa actgcttcag aattaattgc taaagaaaat 
gtgaaaacaa gtggcagtga agtaggaaat gtttataatt tcttaattgt attaacagct 
ctacaagcaa aagcttttct tactttaaca acatgccgaa aattattagg cttagcaggt 
attgattata cttctattat gaatgaacat ttaaataagg aaaaagagga atttagagta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
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aacatccttc 

agtgatgaag 

gaaatgagca 

tatcaagttg 

tgtccagatc 

gtaattacta 

aatttttatg 

gaagcggagt 

atcagtgaaa 

agattaatta 

agcaataaag 

aacgggtcca 

gtagatcata 

ttttcacaat 

gttaaaggaa 

gatacaaata 

gatttaaagg 

aaatttacaa 

ccgaattctt 

ttggggacaa 

ataagcttta 

tttgaacgaa 

tccaataata 

tcatttgaaa 



ctacactttc 

atgcaaagat 

atgattcaat 

ataaggattc 

aatctgaaca 

aaattgattt 

attcttctac 

atagaacgtt 

catttttgac 

ctttaacatg 

aaactaaatt 

tagaagagga 

caggcggagt 

ttattggaga 

aaccttctat 

ataatttaaa 

gagtgtattt 

ttttagaaat 

ggattacgac 

atgggacctt 

ctgcatcagg 

gcaaccttat 

ccggattata 

acgtttctat 



taatactttt 

gattgtggaa 

cacagtatta 

cctatcggag 

aatatattat 

cactaaaaaa 

aggagaaatt 

aagtgctaat 

tccgataaat 

taaatcatat 

gatcgtccca 

caatttagag 

gaatggaact 

taagttaaaa 

tcatttaaaa 

agattatcaa 

aattttaaaa 

taagcctgcg 

tccaggggct 

tagacaaagt 

accatttaat 

gtcttcaact 

tgtagaactt 

taaataa 



tctaatccta 

gctaaaccag 

aaagtatatg 

gttatttatg 

acaaataaca 

atgaaaactt 

gacttaaata 

gatgatggag 

gggtttggcc 

ttaagagaac 

ccaagtggtt 

ccgtggaaag 

aaagctttat 

ccgaaaactg 

gatgaaaata 

actattacta 

agtcaaaatg 

gaggatttat 

agcatttcag 

ctttcattaa 

gtgacggtaa 

agtcatattt 

tcccgtcgct 



attatgcaaa 

gacatgcatt 

aggctaagct 

gtgatacgga 

tagtattccc 

taagatatga 

agaaaaaagt 

tgtatatgcc 

tccaagctga 

tactgctagc 

ttattagcaa 

caaataataa 

atgttcataa 

agtatgtaat 

ctggatatat 

aacgttttac 

gagatgaagc 

taagcccaga 

gaaataaact 

acagttattc 

gaaattctag 

ctgggacatt 

ctggtggtgg 



agttaaagga 

ggttgggttt 

aaaacaaaat 

taaattattt 

aaatgaatat 

ggtaacagcg 

agaatcaagt 

attaggtgtc 

tgaaaattca 

aacagactta 

tattgtagag 

gaatgcgtat 

ggacggagga 

ccaatatact 

tcattatgaa 

tacaggaact 

ttggggagat 

attaattaat 

tttcattaac 

aacttatagt 

ggragtatta 

caaaactgaa 

tggtcatata 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2367 



<210> 2 

<211> 788 

<212> PRT 

<213> Bacilus thuringiensis 



<220> 

<221> MISC_FEATURE 

<222> (1)..{788) 

<223> VipSC Toxin 

The Xaa at position 738 is either the amino acid Glu or Gly. 

<400> 2 

Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe 

2 



wo 03/075655 PCTAJS03/04735 
15 10 15 

He Asp Tyr Phe Asn Gly He Tyr Gly Phe Ala Thr Gly He Lys Asp 
20 25 30 

He Met Asn Met He Phe Lys Thr Asp Thr Gly Gly Asn Leu Thr Leu 
'35 40 45 

Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Glu He Ser Gly Lys 
50 S5 60 

Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 

Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 . 110 

Met Leu His He Tyr Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
115 120 125 

Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 

Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
145 150 155 160 

Leu He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He 
165 170 175 

Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 

Thr Leu Lys Val Lys Lys Asp Ser Ser Pro Ala Asp He Leu Asp Glu 
195- 200 205 

Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 

Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 



Ala Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 
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T^n Phe Leu lie Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 



Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Gly lie Asp Tyr Thr 
290 295 300 



Ser lie Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 



Asn lie Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 



Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met lie Val Glu Ala Lys 
340 345 350 



Pro Gly His Ala Leu Val Gly Phe Glu Met Ser Asn Asp Ser lie Thr 
355 360 365 



Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 



Lys Asp Ser Leu Ser Glu Val lie Tyr Gly Asp Thr Asp Lys Leu Phe 
385 390 395 400 



Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 



Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 



Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 



Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 



Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 



He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
485 490 495 



Asp Glu Asn Ser Arg Leu He Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 



Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
515 520 525 
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Val Pro Pro Ser Gly Phe lie Ser Asn lie Val Glu Asn Gly Ser He 
530 535 540 



Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 



Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 



Lys Asp Gly Gly Phe Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 



Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 



Leu Lys Asp Glu Asn Thr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 



Asn Leu Lys Asp Tyr Gin Thr He Thr Lys Arg Phe Thr Thr Gly Thr 
625 630 635 640 



Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 



Ala Trp Gly Asp Lys Phe Thr He Leu Glu He Lys Pro Ala Glu Asp 
660 665 670 



Leu Leu Ser Pro Glu Leu He Asn Pro Asn Ser Trp He Thr Thr Pro 
675 680 685 



Gly Ala Ser He Ser Gly Asn Lys Leu Phe He Asn Leu Gly Thr Asn 
690 695 700 



Gly Thr Phe Arg Gin Ser Leu Ser Leu Asn Ser Tyr Ser Thr Tyr Ser 
705 710 715 720 



He Ser Phe Thr Ala Ser Gly Pro Phe Asn Val Thr Val Arg Asn Ser 
725 730 735 



Arg Xaa Val Leu Phe Glu Arg Ser Asn Leu Met Ser Ser Thr Ser His 
740 745 750 



He Ser Gly Thr Phe Lys Thr Glu Ser Asn Asn Thr Gly Leu Tyr Val 
755 760 765 



Glu Leu Ser Arg Arg Ser Gly Gly Gly Gly His He Ser Phe Glu Asn 
770 775 780 
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Val Ser lie Lys 
785 

<210> 3 
<211> 2367 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Maize optimized vipSC coding sequence. 

An "r" at positions 2213 and 2214 represents the nucleotide g or 
a. 

<400> 3 

atgaacaaga acaacaccaa gctctccacc cgcgccctcc cgtccttcat cgactacttc 60 

aacggcatct acggcttcgc caccggcatc aaggacatca tgaacatgat cttcaagacc 120 

gacaccggcg gcaacctcac cctcgacgag atcctcaaga accagcagct cctcaacgag 180 

atcagcggca agctcgacgg cgtgaacggc tccctcaacg acctcatcgc ccagggcaac 240 

ctcaacaccg agctgtccaa ggagatcctc aagatcgcca acgagcagaa ccaggtgctc 300 

aacgacgtga acaacaagct cgacgccatc aacaccatgc tccacatcta cctcccgaag 360 

atcacctcca tgctctccga cgtgatgaag cagaactacg ccctctccct ccagatcgag 420 

tacctctcca agcagctcca ggagatcagc gacaagctcg acatcatcaa cgtgaacgtg 480 

ctcatcaact ccaccctcac cgagatcacc ccggcctacc agcgcatcaa gtacgtgaac 540 

gagaagttcg aggagctgac cttcgccacc gagaccaccc tcaaggtgaa gaaggactcc 600 

tccccggccg acatcctcga cgagctgacc gagctgaccg agctggccaa gtccgtgacc 660 

aagaacgacg tggacggctt cgagttctac ctcaacacct tccacgacgt gatggtgggc 720 

aacaacctct tcggccgctc cgccctcaag accgcctccg agctgatcgc caaggagaac 780 

gtgaagacct ccggctccga ggtgggcaac gtgtacaact tcctcatcgt gctcaccgcc 840 

ctgcaggcca aggccttcct caccctcacc acctgccgca agctcctcgg cctcgccggc 900 

atcgactaca cctccatcat gaacgagcac ctcaacaagg agaaggaggd gttccgcgtg 960 

aacatcctcc cgaccctctc caacaccttc tccaacccga actacgccaa ggtgaagggc 1020 

tccgacgagg acgccaagat gatcgtggag gccaagccgg gccacgccct cgtgggcttc 1080 

gagatgtcca acgactccat caccgtgctc aaggtgtacg aggccaagct caagcagaac 1140 

taccaggtgg acaaggactc cctctccgag gtgatctacg gcgacaccga caagctcttc 1200 

tgcccggacc agtccgagca gatatactac accaacaaca tcgtgttccc gaacgagtac 1260 

gtgatcacca agatcgactt caccaagaag atgaagaccc tccgctacga ggtgaccgcc 1320 

aacttctacg actcctccac cggcgagatc gacctcaaca agaagaaggt ggagtcctcc 1380 

gaggccgagt accgcaccct ctccgccaac gacgacggcg tgtacatgcc gctcggcgtg 1440 

atctccgaaa ccttcctcac cccgatcaac ggcttcggcc tccaggccga cgagaactcc 1500 
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cgcctcatca ccctcacctg caagtcctac ctccgcgagc tgctcctcgc caccgacctc 1560 

tccaacaagg agaccaagct catcgtgccg ccgtccggct tcatctccaa catcgtggag 1620 

aacggctcca tcgaggagga caacctcgag ccgtggaagg ccaacaacaa gaacgcctac 1680 

gtggaccaca ccggcggcgt gaacggcacc aaggccctct acgtgcacaa ggacggcggc 1740 

ttctcccagt tcatcggcga caagctcaag ccgaagaccg agtacgtgat ccagtacacc 1800 

gtgaagggca agccgtccat ccacctcaag gacgagaaca ccggctacat ccactacgag 1860 

gacaccaaca acaacctcaa ggactaccag accatcacca agcgcttcac caccggcacc 1920 

gacctcaagg gcgtgtacct catcctcaag tcccagaacg gcgacgaggc ctggggcgac 1980 

aagttcacca tccttgagat caagccggcc gaggacctcc tctccccgga gctgatcaac 2040 

ccgaactcct ggatcaccac cccgggcgcc tccatctccg gcaacaagct cttcatcaac 2100 

ctcggcacca acggcacctt ccgccagtcc ctctccctca actcctactc cacctactcc 2160 

atctccttca ccgcctccgg cccgttcaac gtgaccgtgc gcaactcccg cgrrgtgctc 2220 

ttcgagcgct ccaacctcat gtcctccacc tcccacatct ccggcacctt caagaccgag 2280 

tccaacaaca ccggcctcta cgtggagctg tcccgccgct ccggcggcgg cggccacatc 2340 

tccttcgaga acgtgtccat caagtag 2367 

<210> 4 
<211> 2370 
<212> DNA 

<213> Bacillus thuringiensis 
<220> 

<221> misc^feature 
<222> (1)..{2370) 

<223> vip3A(a) native coding sequence. 
<400> 4 

atgaacaaga ataatactaa attaagcaca agagccttac caagttttat tgattatttt 60 

aatggcattt atggatttgc cactggtatc aaagacatta tgaacatgat ttttaaaacg 120 

gatacaggtg gtgatctaac cctagacgaa attttaaaga atcagcagtt actaaatgat 180 

atttctggta aattggatgg ggtgaatgga agcttaaatg atcttatcgc acagggaaac 240 

ttaaatacag aattatctaa ggaaatatta aaaattgcaa atgaacaaaa tcaagtttta 300 

aatgatgtta ataacaaact cgatgcgata aatacgatgc ttcgggtata tctacctaaa 360 

attacctcta tgttgagtga tgtaatgaaa caaaattatg cgctaagtct gcaaatagaa 420 

tacttaagta aacaattgca agagatttct gataagttgg atattattaa tgtaaatgta 480 

cttattaact ctacacttac tgaaattaca cctgcgtatc aaaggattaa atatgtgaac 540 

gaaaaatttg aggaattaac ttttgctaca gaaactagtt caaaagtaaa aaaggatggc 600 

tctcctgcag atattcttga tgagttaact gagttaactg aactagcgaa aagtgtaaca 660 
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aaaaatgatg tggatggttt tgaattttac cttaatacat tccacgatgt aatggtagga 720 

aataatttat tcgggcgttc agctttaaaa actgcatcgg aattaattac taaagaaaat 780 

gtgaaaacaa gtggcagtga ggtcggaaat gtttataact tcttaattgt attaacagct 840 

ctgcaagcaa aagcttttct tactttaaca acatgccgaa aattattagg cttagcagat 900 

attgattata cttctattat gaatgaacat ttaaataagg aaaaagagga atttagagta 960 

aacatcctcc ctacactttc taatactttt tctaatccta attatgcaaa agttaaagga 1020 

agtgatgaag atgcaaagat gattgtggaa gctaaaccag gacatgcatt gattgggttt 1080 

gaaattagta atgattcaat tacagtatta aaagtatatg aggctaagct aaaacaaaat 1140 

tatcaagtcg ataaggattc cttatcggaa gttatttatg gtgatatgga taaattattg 1200 

tgcccagatc aatctgaaca aatctattat acaaataaca tagtatttcc aaatgaatat 1260 

gtaattacta aaattgattt cactaaaaaa atgaaaactt taagatatga ggtaacagcg 1320 

aatttttatg attcttctac aggagaaatt gacttaaata agaaaaaagt agaatcaagt 1380 

gaagcggagt atagaacgtt aagtgctaat gatgatgggg tgtatatgcc gttaggtgtc 1440 

atcagtgaaa catttttgac tccgattaat gggtttggcc tccaagctga tgaaaattca 1500 

agattaatta ctttaacatg taaatcatat ttaagagaac tactgctagc aacagactta 1560 

agcaataaag aaactaaatt gatcgtcccg ccaagtggtt ttattagcaa tattgtagag 1620 

aacgggtcca tagaagagga caatttagag ccgtggaaag caaataataa gaatgcgtat 1680 

gtagatcata caggcggagt gaatggaact aaagctttat atgttcataa ggacggagga 1740 

atttcacaat ttattggaga taagttaaaa ccgaaaactg agtatgtaat ccaatatact 1800 

gttaaaggaa aaccttctat tcatttaaaa gatgaaaata ctggatatat tcattatgaa 1860 

gatacaaata ataatttaga agattatcaa actattaata aacgttttac tacaggaact 1920 

gatttaaagg gagtgtattt aattttaaaa agtcaaaatg gagatgaagc ttggggagat 1980 

aactttatta ttttggaaat tagtccttct gaaaagttat taagtccaga attaattaat 2040 

acaaataatt ggacgagtac gggatcaact aatattagcg gtaatacact cactctttat 2100 

cagggaggac gagggattct aaaacaaaac cttcaattag atagtttttc aacttataga 2160 

gtgtattttt ctgtgtccgg agatgctaat gtaaggatta gaaattctag ggaagtgtta 2220 

tttgaaaaaa gatatatgag cggtgctaaa gatgtttctg aaatgttcac tacaaaattt 2280 

gagaaagata acttttatat agagctttct caagggaata atttatatgg tggtcctatt 2340 
gtacattttt acgatgtctc tattaagtaa 

<210> 5 
<211> 789 
<212> PRT 

<213> Bacillus thuringiensis 



2370 
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<220> 

<221> MISC_FEATURE 

<222> (1)..(789) 

<223> VipSA toxin 

<400> 5 

Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg 7U.a Leu Pro Ser Phe 
15 10 15 



He Asp Tyr Phe Asn Gly He Tyr Gly Phe Ala Thr Gly He Lys Asp 
20 25 30 



He Met Asn Met He Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu 
35 40 45 



Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser Gly Lys 
50 55 60 



Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu . He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 



Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 110 

Met Leu Arg Val Tyr Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
115 120 125 



Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 



Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
145 150 155 160 



Leu He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He 
165 170 175 



Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 



Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Leu Asp Glu 
195 200 205 



Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 



Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
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225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 

Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 

Asn Phe Leu He Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 

Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp lie Asp Tyr Thr 
290 295 300 

Ser He Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 

Asn He Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 

Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He Val Glu Ala Lys 
340 345 350 

Pro Gly His Ala Leu He Gly Phe Glu He Ser Asn Asp Ser He Thr 
355 360 365 

Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 

Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 

Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 

Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 

Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 

Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 

Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 



He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
485 490 495 
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Asp Glu Asn Ser Arg Leu lie Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 



Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu lie 
515 520 525 



Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly Ser He 
530 535 540 



Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 



Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 



Lys Asp Gly Gly He Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 



Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 



Leu Lys Asp Glu Asn Thr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 



Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg Phe Thr Thr Gly Thr 
625 630 635 640 



Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 



Ala Trp Gly Asp Asn Phe He He Leu Glu He Ser Pro Ser Glu Lys 
660 665 670 



Leu Leu Ser Pro Glu Leu He Asn Thr Asn Asn Trp Thr Ser Thr Gly 
675 680 685 



Ser Thr Asn He Ser Gly Asn Thr Leu Thr Leu Tyr Gin Gly Gly Arg 



Gly He Leu Lys Gin Asn Leu Gin Leu Asp Ser Phe Ser Thr Tyr Arg 
705 710 715 720 



Val Tyr Phe Ser Val Ser Gly Asp Ala Asn Val Arg He Arg Asn Ser 
725 730 735 



Arg Glu Val Leu Phe Glu Lys Arg Tyr Met Ser Gly Ala Lys Asp Val 
740 745 750 
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Ser Glu Met Phe Thr Thr Lys Phe Glu Lys Asp J^n Phe Tyr lie Glu 
755 760 765 



Leu Ser Gin Gly Asn Asn Leu Tyr Gly Gly Pro He Val His Phe Tyr 
770 775 780 



Asp Val Ser He Lys 
785 



<210> 6 
<211> 2364 
<212> DMA 

<213> Bacillus thuringiensis 
<220> 

<221> misc_feature 
<222> (1)..(2364) 

<223> vip3B native coding sequence. 
<400> 6 

atgaacaaga ataatactaa attaaacgca agggccttac cgagttttat tgattatttt 60 
aatggcattt atggatttgc cactggtatc aaagacatta tgaacatgat ttttaaaacg 120 
gatacaggtg gaaatctaac cctagacgaa attttaaaaa atcagcagtt attaaatgag 180 
atttctggta aattggatgg ggtaaatggg agcttaaacg atcttatcgc acagggaaac 240 
ttaaatacag aattatctaa ggaaatctta aaaattgcaa atgagcagaa tcaagtctta 
aatgatgtta ataacaaact taatgcgata aatacaatgc ttcacatata tctacctaaa 
attacatcta tgttaaatga tgtaatgaaa caaaattatg cactaagtct gcaaatagaa 
tacctaagta aacaattgca agaaatttcc gacaagttag atgtcattaa cgtgaatgta 480 
cttattaact ctacacttac tgaaattaca cctgcgtatc aacggatgaa atatgtaaat 540 
gaaaaatttg aagatttaac ttttgctaca gaaaccactt taaaagtaaa aaagaatagc 600 
tcccctgcag atattcttga tgagttaact gagttaactg aactagcgaa aagtgtaaca 660 
aaaaatgacg tggatggttt tgaattttac cttaatacat tccacgatgt aatggtagga 720 
aacaatttat tcgggcgttc agctttaaaa actgcttcgg aattaatcgc taaagaaaat 780 
gtgaaaacaa gtggcagtga ggtaggaaat gtttataatt tcttaattgt attaacagct 840 
ctgcaagcaa aagcttttct tactttaaca acatgccgga aattattagg cttagcagat 



300 
360 
420 



900 



attgattata ctttcattat gaatgaacat ttagataagg aaaaagagga atttagagta 960 

aatatccttc ctacactttc taatactttt tctaatccta actatgcaaa agctaaagga 1020 

agcaatgaag atgcaaagat aattgtggaa gctaaaccag gatatgcttt ggttggattt 1080 

gaaatgagca atgattcaat cacagtatta aaagcatatc aggctaagct aaaacaagat 1140 

tatcaagttg ataaagattc gttatcagaa attgtctatg gtgatatgga taaattattg 1200 
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tgcccggatc aatctgaaca aatatattat acaaataaca ttgcttttcc caatgaatat 1260 

gtaattacta aaattacttt tactaaaaaa atgaatagtt taagatatga ggcaacagct 1320 

aatttttatg attcttctac aggggatatt gatctaaata agacaaaagt agaatcaagt 1380 

gaagcagagt atagtacgct aagtgctagt actgatggag tctatatgcc gttaggtatt 1440 

atcagtgaaa catttttgac tccaattaat gggtttggaa tcgtagtcga tgaaaattca 1500 

aaattagtaa atttaacatg taaatcatat ttaagagagg tattattagc aacagactta 1560 

agtaataaag aaactaaatt gattgtccca cctattggtt ttattagcaa tattgtagaa 1620 

aatgggaact tagagggaga aaacttagag ccgtggaaag caaataacaa aaatgcgtat 1680 

gtagatcata caggcggcgt aaatggaact aaagctttat atgttcataa ggatggtgag 1740 

ttttcacaat ttattggaga taagttgaaa tcgaaaacag aatatgtaat tcaatatatt 1800 

gtaaagggaa aagcttctat tcttttgaaa gatgaaaaaa atggtgattg catttatgaa 1860 

gatacaaata atggtttaga agattttcaa accattacta aaagttttat tacaggaacg 1920 

gattcttcag gagttcattt aatatttaat agtcaaaatg gcgatgaagc atttggggaa 1980 

aactttacta tttcagaaat taggctttcc gaagatttat taagtccaga attgataaat 2040 

tcagatgctt gggttggatc tcagggaact tggatctcag gaaattcact cactattaat 2100 

agtaatgtga atggaacttt tcgacaaaac ctttcgttag aaagctattc aacttatagt 2160 

atgaacttta atgtgaatgg atttgccaag gtgacagtaa gaaattcccg tgaagtatta 2220 

tttgaaaaaa attatccgca gctttcacct aaagatattt ctgaaaaatt cacaactgca 2280 

gccaataata ccgggttgta tgtagagctt tctcgtttta catcgggtgg cgctataaat 2340 

ttccggaatt tttcgattaa gtga 2364 

<210> 7 

<211> 787 

<212> PRT 

<213> Bacillus thuringiensis 
<220> 

<221> MISC_FEATURE 

<222> (1)..(787) 

<223> Vip3B Toxin 

<400> 7 

Met Asn Lys Asn Asn Thr Lys Leu Asn Ala Arg Ala Leu Pro Set Phe 
15 10 15 

He Asp Tyr Phe Asn Gly He Tyr Gly Phe Ala Thr Gly He Lys Asp 
20 25 30 

He Met Asn Met He Phe Lys Thr Asp Thr Gly Gly Asn Leu Thr Leu 



35 



40 



45 
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Asp Glu lie Leu Lys Asn Gin Gin Leu Leu Asn Glu He Ser Gly Lys 
50 55 60 

Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 

Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 

Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asn Ala He Asn Thr 
100 105 110 

Met Leu His He Tyr Leu Pro Lys He Thr Ser Met Leu Asn Asp Val 
115 120 125 

Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 

Gin Leu Gin Glu He Ser Asp Lys Leu Asp Val He Asn Val Asn Val 
145 150 155 160 

hen He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg Met 
165 170 175 

Lys Tyr Val Asn Glu Lys Phe Glu Asp Leu Thr Phe Ala Thr Glu Thr 
180 185 190 

Thr Leu Lys Val Lys Lys Asn Ser Ser Pro Ala Asp He Leu Asp Glu 
195 200 205 

Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 

Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 

Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 

Ala Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 

TVsn Phe Leu He Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 

Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp Tyr Thr 
290 295 300 
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Phe He Met Asn Glu His Leu Asp Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 

Asn He Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 

Lys Ala Lys Gly Ser Asn Glu Asp Ala Lys He He Val Glu Ala Lys 
340 345 350 

Pro Gly Tyr Ala Leu Val Gly Phe Glu Met Ser Asn Asp Ser He Thr 
355 360 365 

Val Leu Lys Ala Tyr Gin Ala Lys Leu Lys Gin Asp Tyr Gin Val Asp 
370 375 380 

Lys Asp Ser Leu Ser Glu He Val Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 

Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Ala Phe 
405 410 415 

Pro Asn Glu Tyr Val He Thr Lys He Thr Phe Thr Lys Lys Met Asn 
420 425 430 

Ser Leu Arg Tyr Glu Ala Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 

Asp He Asp Leu Asn Lys Thr Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 

Ser Thr Leu Ser Ala Ser Thr Asp Gly Val Tyr Met Pro Leu Gly He 
465 470 475 480 

He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly He Val Val 
485 490 495 

Asp Glu Asn Ser Lys Leu Val Asn Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 

Glu Val Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
515 520 525 

val Pro Pro He Gly Phe He Ser Asn He Val Glu Asn Gly Asn Leu 
530 535 540 

Glu Gly Glu Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 



Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
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565 570 575 



Lys Asp Gly Glu Phe Ser Gin Phe lie Gly Asp Lys Leu Lys Ser Lys 
580 585 590 



Thr Glu Tyr Val lie Gin Tyr He Val Lys Gly Lys Ala Ser He Leu 
595 600 605 



Leu Lys Asp Glu Lys Asn Gly Asp Cys He Tyr Glu Asp Thr Asn Asn 
610 615 620 



Gly Leu Glu Asp Phe Gin Thr He Thr Lys Ser Phe He Thr Gly Thr 
625 630 635 640 



Asp Ser Ser Gly Val His Leu He Phe Asn Ser Gin Asn Gly Asp Glu 
645 650 655 



Ala Phe Gly Glu Asn Phe Thr He Ser Glu He Arg Leu Ser Glu Asp 
660 665 670 



Leu Leu Ser Pro Glu Leu He Asn Ser Asp Ala Trp Val Gly Ser Gin 
675 680 685 



Gly Thr Trp He Ser Gly Asn Ser Leu Thr He Asn Ser Asn Val Asn 
690 695 700 



Gly Thr Phe Arg Gin Asn Leu Ser Leu Glu Ser Tyr Ser Thr Tyr Ser 
705 710 715 720 



Met Asn Phe Asn Val Asn Gly Phe Ala Lys Val Thr Val Arg Asn Ser 
725 730 735 



Arg Glu Val Leu Phe Glu Lys Asn Tyr Pro Gin Leu ser Pro Lys Asp 
740 745 750 



He Ser Glu Lys Phe Thr Thr Ala Ala Asn Asn Thr Gly Leu Tyr Val 
755 760 765 



Glu Leu Ser Arg Phe Thr Ser Gly Gly Ala He Asn Phe Arg Asn Phe 
770 775 780 



Ser He Lys 
785 



<210> 8 

<211> 2407 

<212> DMA 

<213> Bacillus thuringiensis 
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<220> 

<221> iaiscj_feature 
<222> (1)..(2406) 

<223> vip3Z native coding sequence. 
<400> 8 

atgaataata ctaagttaaa cgcaagggct ttaccaagtt ttattgatta ttttaatggc 60 

atttatggat ttgccactgg tatcaaagac attatgaaca tgatttttaa aacggataca 120 

ggtggtggta atttaacact agatgaaatt ttaaagaatc aagatttatt aaatcaaatc 180 

tcagataaac tcgatggaat taatggagat ttaggtgatc ttattgcaca aggcaattta 240 

aattcagaac taactaagga attattaaaa attgcgaatg agcagaatct gatgttaaat 300 

aatgttaatg ctcaacttaa ttcaataaat tcaacactta acacctatct gccaaaaatt 360 

acatctatgc taagtgaggt aatgaaacaa aactatgtat taagtctaca aatagaattt 420 

cttagtgaac aattacaaga aatatcagat aaacttgatg ttatcaattt aaatgtatta 480 

attaactcta cattgacaga aattacgcct gcatatcaac gtattaaata tgtaaatgat 540 

aaatttgatg aattgacttc tactgtggaa aaaaatccga aaattaatca agataatttt 600 

actgaagatg ttattgataa tttaactgat ttaactgaac tagcacgaag tgtaacgaga 660 

aatgatatgg atagttttga attttatatt aaaactttcc atgatgtgat gataggaaat 720 

aatttattca gtcgttctgc attaaaaact gcttcagaat taattgctaa ggaaaatata 780 

catactatgg gaagtgaaat tggtaatgtc tacactttta tggttgtttt gacttcctta 840 

caagcaaaag cgttcctaac tttaactgca tgccgtaaat tattaggatt aacagatatc 900 

gattatacac aaattatgaa tgaaaattta aatagagaaa aagaggaatt tcgcttaaat 960 

attcttccta cactttctaa tgatttttct aatcctaatt atacagaaac tttaggaagt 1020 

gatcttgtag atcctattgt tacgttagaa gctgatcctg gttatgcttt aataggtttt 1080 

gagattctca atgatccact tccagtatta aaagtatatc aggcaaagct aaaaccaaat 1140 

tatcaagtcg acaaagagtc gattatggaa aatatttatg gaaatatcca caaactactt 1200 

tgtccaaaac aacgtcacca aaaatattat ataaaagaca ttacatttcc tgaaggttat 1260 

gtaatcacca aaattgtttt tgaaaaaaaa ttgaatctat taggatatga agtaacagca 1320 

aatctttatg acccatttac aggaagtatc gatttgaata agactattct agaatcatgg 1380 

aaggaagaat gctgtgaaga agaatgctgt gaagaagaat gctgtgaaga agaatgctgt 1440 

gaagaattat ataaaattat agaggcggat actaacggtg tttatatgcc gttgggagta 1500 

attagtgaaa catttttaac accaatctat agttttaaac taattattga cgaaagaaca 1560 

aagagaatat ctttagcggg taaatcttat ttacgtgaat ctttactagc cacagattta 1620 

gttaataaag atacgaattt aattccttca cccaatggtt tcattaacag tattgtggaa 1680 

aattggaata taacatcgga taatatagag ccctggaaag cgaataataa aaatgcatat 1740 

gtcgataaga cggatgacat ggtgggattt aactctttat atactcataa ggatggggaa 1800 
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ttcttgcaat 
gtaaaaggga 
acaaataaat 
tccgaaatat 
attatactag 
atgccttttg 
actatgacgc 
aaaggaaaag 
aacgaagagg 
ctatttaatg 
gaatagg 



ttattggagc 
gtccggaagt 
ttgatacgtt 
atctagtttt 
aaatcaagtc 
gtaatgctga 
aaaatattca 
ggagagtagc 
tttctacaat 
atgacgtgga 



taagttaaag 
ttatttgaaa 
tcaaactata 
taaaaatcaa 
atttgaaacc 
gattaaagaa 
attagaacag 
gatacaaact 
gattacaact 
gaactccaaa 



gctaaaactg 
aacaataaag 
actaaaaagt 
attggatatg 
ctaccacaaa 
gatggaaaaa 
aattccaagt 
caaagctccc 
agaaacttat 
gttatttttt 



agtatatcat 
gtatctttta 
tcaattcagg 
aagcatgggg 
tattaaaacc 
ttgagatttc 
atcatctaag 
atataaatgt 
acggtgaagg 
cggatgtatc 



tcaatatact 
tgaggataca 
agtagatcca 
aaataaattt 
tgaaaattgg 
aggtaatgga 
attttctgta 
accagctaca 
tatgatatac 
tctagttaaa 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2407 



<210> 9 

<211> 801 

<212> PRT 

<213> Bacillus thuringiensis 



<220> 

<221> MISC_FEATURE 

<222> (1)..(801) 

<223> VipSZ ,toxin 



<400> 9 

Met Asn Asn Thr Lys Leu Asn Ala Arg Ala Leu Pro Ser Phe lie Asp 
1 5 10 15 



Tyr Phe Asn Gly lie Tyr Gly Phe Ala Thr Gly lie Lys Asp He Met 
20 25 30 



Asn Met He Phe Lys Thr Asp Thr Gly Gly Gly Asn Leu Thr Leu Asp 
35 40 45 



Glu He Leu Lys Asn Gin Asp Leu Leu Asn Gin He Ser Asp Lys Leu 
50 55 60 



Asp Gly He Asn Gly Asp Leu Gly Asp Leu He Ala Gin Gly Asn Leu 
65 70 75 80 



Asn Ser Glu Leu Thr Lys Glu Leu Leu liys He Ala Asn Glu Gin Asn 
85 90 95 



Leu Met Leu Asn Asn Val Asn Ala Gin Leu Asn Ser He Asn Ser Thr 
100 105 110 
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Leu Asn Thr Tyr Leu Pro Lys lie Thr Ser Met Leu Ser Glu Val Met 
115 120 125 

Lys Gin Asn Tyr Val Leu Ser Leu Gin lie Glu Phe Leu Ser Glu Gin 
130 135 140 

Leu Gin Glu lie Ser Asp Lys Leu Asp Val lie Asn Leu Asn Val Leu 
145 150 155 160 

He Asn ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He Lys 
165 170 175 

Tyr Val Asn T^p Lys Phe Asp Glu Leu Thr Ser Thr Val Glu Lys Asn 
180 185 190 

Pro Lys He Asn Gin Asp Asn Phe Thr Glu Asp Val He Asp Asn Leu 
195 200 205 

Thr Asp Leu Thr Glu Leu Ala Arg Ser Val Thr Arg Asn Asp Met Asp 
210 215 220 

Ser Phe Glu Phe Tyr He Lys Thr Phe His Asp Val Met He Gly Asn 
225 230 235 240 

Asn Leu Phe Ser Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He Ala 
245 250 255 

Lys Glu Asn He His Thr Met Gly Ser Glu He Gly Asn Val Tyr Thr 
260 265 270 

Phe Met val Val Leu Thr Ser Leu Gin Ala Lys Ala Phe Leu Thr Leu 
275 280 285 

Thr Ala Cys Arg Lys Leu Leu Gly Leu Thr Asp He Asp Tyr Thr Gin 
290 295 300 

He Met Asn Glu Asn Leu Asn Arg Glu Lys Glu Glu Phe Arg Leu Asn 
305 310 315 320 

He Leu Pro Thr Leu Ser Asn Asp Phe Ser Asn Pro Asn Tyr Thr Glu 
325 330 335 

Thr Leu Gly Ser Asp Leu Val Asp Pro He Val Thr Leu Glu Ala Asp 
340 345 350 

Pro Gly Tyr Ala Leu He Gly Phe Glu He Leu Asn Asp Pro Leu Pro 
355 360 365 



Val Leu Lys Val Tyr Gin Ala Lys Leu Lys Pro Asn Tyr Gin Val Asp 
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370 375 380 

Lys GXu Ser lie Met Glu Asn He Tyr Gly Asn He His Lys Leu Leu 
385 390 395 400 

Cys Pro Lys Gin Arg His Gin Lys Tyr Tyr He Lys Asp He Thr Phe 
405 410 415 

Pro Glu Gly Tyr Val He Thr Lys He Val Phe Glu Lys Lys Leu Asn 
420 425 430 , 

Leu Leu Gly Tyr Glu Val Thr Ala Asn Leu Tyr Asp Pro Phe Thr Gly 
435 440 445 

Ser He Asp Leu Asn Lys Thr He Leu Glu Ser Trp Lys Glu Glu Cys 
450 455 460 

Cys Glu Glu Glu Cys Cys Glu Glu Glu Cys Cys Glu Glu Glu Cys Cys 
465 470 475 480 

Glu Glu Leu Tyr Lys He He Glu Ala Asp Thr Asn Gly Val Tyr Met 
485 490 495 

Pro Leu Gly Val He Ser Glu Thr Phe Leu Thr Pro He Tyr Ser Phe 
500 505 510 

Lys Leu He He Asp Glu Arg Thr Lys Arg He Ser Leu Ala Gly Lys 
515 520 525 

Ser Tyr Leu Arg Glu Ser Leu Leu Ala Thr Asp Leu Val Asn Lys Asp 
530 535 540 

Thr Asn Leu He Pro Ser Pro Asn Gly Phe He Asn Ser He Val Glu 
545 550 555 560 

T^n Trp Asn He Thr Ser Asp Asn He Glu Pro Trp Lys Ala Asn Asn 
565 570 575 

Lys Asn Ala Tyr Val Asp Lys Thr Asp Asp Met Val Gly Phe Asn Ser 
580 585 590 

Leu Tyr Thr His Lys Asp Gly Glu Phe Leu Gin Phe He Gly Ala Lys 
595 600 605 

Leu Lys Ala Lys Thr Glu Tyr He He Gin Tyr Thr Val Lys Gly Ser 
610 615 620 

Pro Glu Val Tyr Leu Lys Asn Asn Lys Gly He Phe Tyr Glu Asp Thr 
625 630 635 640 
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Thr Asn Lys Phe Asp Thr Phe Gin Thr lie Thr Lys Lys Phe Asn Ser 
645 650 655 



Gly Val Asp Pro Ser Glu lie Tyr Leu Val Phe Lys Asn Gin lie Gly 
660 665 670 



Tyr Glu Ala Trp Gly Asn Lys Phe lie lie Leu Glu lie Lys Ser Phe 
675 680 685 



Glu Thr Leu Pro Gin lie Leu Lys Pro Glu Asn Trp Met Pro Phe Gly 
690 695 700 



Asn Ala Glu lie Lys Glu Asp Gly Lys He Glu He Ser Gly Asn Gly 
705 710 715 720 



Thr Met Thr Gin Asn He Gin Leu Glu Gin Asn Ser Lys Tyr His Leu 
725 730 735 



Arg Phe Ser Val Lys Gly Lys Gly Arg Val Ala He Gin Thr Gin Ser 
740 745 750 



Ser His He Asn Val Pro Ala Thr Asn Glu Glu Val Ser Thr Met He 
755 760 765 



Thr Thr Arg Asn Leu Tyr Gly Glu Gly Met He Tyr Leu Phe Asn Asp 

770 775 780 

Asp Val Glu Asn Ser Lys Val He Phe Ser Asp Val Ser Leu Val Lys 

785 790 795 800 



Glu 



<210> 10 
<211> 2367 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> vip3A-C Hybrid toxin coding sequence. 
<400> 10 

atgaacaaga ataatactaa attaagcaca agagccttac caagttttat tgattatttt 60 
aatggcattt atggatttgc cactggtatc aaagacatta tgaacatgat ttttaaaacg 120 
gatacaggtg gtgatctaac cctagacgaa attttaaaga atcagcagtt actaaatgat 180 
atttctggta aattggatgg ggtgaatgga agcttaaatg atcttatcgc acagggaaac 240 
ttaaatacag aattatctaa ggaaatatta aaaattgcaa atgaacaaaa tcaagtttta 300 
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aatgatgtta ataacaaact cgatgcgata aatacgatgc ttcgggtata tctacctaaa 360 

attacctcta tgttgagtga tgtaatgaaa caaaattatg cgctaagtct gcaaatagaa 420 

tacttaagta aacaattgca agagatttct gataagttgg atattattaa tgtaaatgta 480 

cttattaact ctacacttac tgaaattaca cctgcgtatc aaaggattaa atatgtgaac 540 

gaaaaatttg aggaattaac ttttgctaca gaaactagtt caaaagtaaa aaaggatggc 600 

tctcctgcag atattcttga tgagttaact gagttaactg aactagcgaa aagtgtaaca 660 

aaaaatgatg tggatggttt tgaattttac cttaatacat tccacgatgt aatggtagga 720 

aataatttat tcgggcgttc agctttaaaa actgcatcgg aattaattac taaagaaaat 780 

gtgaaaacaa gtggcagtga ggtcggaaat gtttataact tcttaattgt attaacagct 840 

ctgcaagcaa aagcttttct tactttaaca acatgccgaa aattattagg cttagcagat 900 

attgattata cttctattat gaatgaacat ttaaataagg aaaaagagga atttagagta 960 

aacatcctcc ctacactttc taatactttt tctaatccta attatgcaaa agttaaagga 1020 

agtgatgaag atgcaaagat gattgtggaa gctaaaccag gacatgcatt gattgggttt 1080 

gaaattagta atgattcaat tacagtatta aaagtatatg aggctaagct aaaacaaaat 1140 

tatcaagtcg ataaggattc cttatcggaa gttatttatg gtgatatgga taaattattg • 1200 

tgcccagatc aatctgaaca aatctattat acaaataaca tagtatttcc aaatgaatat 1260 

gtaattacta aaattgattt cactaaaaaa atgaaaactt taagatatga ggtaacagcg 1320 

aatttttatg attcttctac aggagaaatt gacttaaata agaaaaaagt agaatcaagt 1380 

gaagcggagt atagaacgtt aagtgctaat gatgatgggg tgtatatgcc gttaggtgtc 1440 

atcagtgaaa catttttgac tccgattaat gggtttggcc tccaagctga tgaaaattca 1500 

agattaatta ctttaacatg taaatcatat ttaagagaac tactgctagc aacagactta 1560 

agcaataaag aaactaaatt gatcgtcccg ccaagtggtt ttattagcaa tattgtagag 1620 

aacgggtcca tagaagagga caatttagag ccgtggaaag caaataataa gaatgcgtat 1680 

gtagatcata caggcggagt gaatggaact aaagctttat atgttcataa ggacggagga 1740 

atttcacaat ttattggaga taagttaaaa ccgaaaactg agtatgtaat ccaatatact 1800 

gttaaaggaa aaccttctat tcatttaaaa gatgaaaata ctggatatat tcattatgaa 1860 

gatacaaata ataatttaga agattatcaa actattaata aacgttttac tacaggaact 1920 

gatttaaagg gagtgtattt aattttaaaa agtcaaaatg gagatgaagc ttggggagat 1980 

aaatttacaa ttttagaaat taagcctgcg gaggatttat taagcccaga attaattaat 2040 

ccgaattctt ggattacgac tccaggggct agcatttcag gaaataaact tttcattaac 2100 

ttggggacaa atgggacctt tagacaaagt ctttcattaa acagttattc aacttatagt 2160 

ataagcttta ctgcatcagg accatttaat gtgacggtaa gaaattctag gggagtatta 2220 

tttgaacgaa gcaaccttat gtcttcaact agtcatattt ctgggacatt caaaactgaa 2280 
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tccaataata ccggattata tgtagaactt tcccgtcgct ctggtggtgg tggtcatata 2340 
^ tcatttgaaa acgtttctat taaataa 2367 



<210> 


11 


<211> 


788 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


Hybrid Vip3A-C toxin 


<400> 


11 



Met Asn Lys Asn Asn Thr Lys Leu Ser Thr Arg Ala Leu Pro Ser Phe 
15 10 15 



lie Asp Tyr Phe Asn Gly lie Tyr Gly Phe Ala Thr Gly He Lys Asp 
20 25 30 



He Met Asn Met He Phe Lys Thr Asp Thr Gly Gly Asp Leu Thr Leu 
35 40 45 



Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Asp He Ser Gly Lys 
50 55 60 



Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 



Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 



Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 110 



Met Leu Arg Val Tyr Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
115 120 125 



Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin He Glu Tyr Leu Ser Lys 
130 135 140 



Gin Leu Gin Glu He Ser Asp Lys Leu Asp He He Asn Val Asn Val 
145 150 155 160 



Leu He Asn Ser Thr Leu Thr Glu He Thr Pro Ala Tyr Gin Arg He 
165- 170 175 



Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 



Ser Ser Lys Val Lys Lys Asp Gly Ser Pro Ala Asp He Leu Asp Glu 
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195 200 205 



Leu Thr 61u Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 



Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 



Asn Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu lie 
245 250 255 



Thr Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly T^n Val Tyr 
260 265 270 



Asn Phe Leu lie Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 



Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp lie Asp Tyr Thr 
290 295 300 



Ser lie Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 



Asn lie Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 



Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met lie Val Glu Ala Lys 
340 345 350 



Pro Gly His Ala Leu lie Gly Phe Glu lie Ser Asn Asp Ser lie Thr 
355 360 365 



Val Leu Lys Val Tyr Glu Ala Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 



Lys Asp Ser Leu Ser Glu Val He Tyr Gly Asp Met Asp Lys Leu Leu 
385 390 395 400 



Cys Pro Asp Gin Ser Glu Gin He Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 



Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 



Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 



Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 

24 



wo 03/075655 PCT/US03/04735 



Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 



lie Ser Glu Thr Phe Leu Thr Pro lie Asn Gly Phe Gly Leu Gin Ala 
485 490 495 



Asp Glu Asn Ser Arg Leu lie Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 



Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu lie 
515 520 525 



Val Pro Pro Ser Gly Phe lie Ser Asn lie Val Glu Asn Gly Ser lie 
530 535 540 



Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 



Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 



Lys Asp Gly Gly lie Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 



Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 , 600 605 



Leu Lys Asp Glu Asn Thr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 



Asn Leu Glu Asp Tyr Gin Thr He Asn Lys Arg Phe Thr Thr Gly Thr 
625 630 635 640 



Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser Gin Asn Gly Asp Glu 
645 650 655 



Ala Trp Gly Asp Lys Phe Thr He Leu Glu He Lys Pro Ala Glu Asp 
660 665 670 



Leu Leu Ser Pro Glu Leu He Asn Pro Asn Ser Trp He Thr Thr Pro 
675 680 685 



Gly Ala Ser He Ser Gly Asn Lys Leu Phe He ?jsn Leu Gly Thr Asn 
690 695 700 



Gly Thr Phe Arg Gin Ser Leu Ser Leu Asn Ser Tyr Ser Thr Tyr Ser 
705 710 715 720 
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He Ser Phe Thr Ala Ser Gly Pro Phe Asn Val Thr Val Arg Asn Ser 
725 730 735 



Arg Gly Val Leu Phe Glu Arg Ser Asn Leu Met Ser Ser Thr Ser His 
740 745 750 



He Ser Gly Thr Phe Lys Thr Glu Ser Asn Asn Thr Gly Leu Tyr Val 
755 760 765 



Glu Leu Ser Arg Arg Ser Gly Gly Gly Gly His He Ser Phe Glu Asn 
770 775 780 



Val Ser He Lys 



785 




<210> 


12 


<211> 


36 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


IF Forward primer 


<400> 


12 



atgaacaaga ataatactaa attaagcaca agagcc 36 



<210> 


13 


<211> 


35 


<212> 


DMA 


<213> 


Artificial Sequence 


<220> 




<223> 


IR reverse primer 


<400> 


13 



ctcaacatag aggtaatttt aggtagatat acccg 35 



<210> 


14 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Primer P3 


<400> 


14 


gatgatgggg tgtatatgcc gttag 


<210> 


15 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 



25 



<220> 

<223> Primer P4 
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<400> 15 

aataaattgt gaaattcctc cgtcc 



25 



<210> 16 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 4F 

<400> 16 

agtcaaaatg gagatcaagg ttggggagat aac 33 

<210> 17 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 4R 



<210> 18 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P5 

<400> 18 

aatggagatg aagcttgggg aga 23 



<210> 19 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P6 



<210> 20 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3CF4 

<400> 20 

gtttagaaga ttttcaaacc attac 25 



<210> 21 



<400> 17 

ttacttaata gagagatcgt ggaaatgtac aata 



34 



<400> 19 

cgtggaaatg tacaatagga ccacc 



25 
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<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer T7 

<400> 21 

ttaatacgac tcactatagg g 21 



<210> 22 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3Cc 



<210> 23 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3Cn 

<400> 23 

caccatgaac aagaataata ctaaattaag cacaagag 38 



<210> 24 

<211> 38 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer Vip3A-N 

<400> 24 

caccatgaac aagaataata ctaaattaag cacaagag 38 



<210> 25 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3A2050 

<400> 25 

taaagttatc tccccaagct tcatctcca 29 



<210> 26 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<400> 22 

tttatttaat agaaacgttt tcaaatgata tatg 



34 



<220> 



28 
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<223> Vip3C-Cl 



<400> 26 

aatggagatg aagcttgggg agat 



24 



<210> 27 

<211> 34 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Vip3C-C2 

<400> 27 

tttatttaat agaaacgttt tcaaatgata tatg 34 



<210> 28 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3Za 



<210> 29 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer Vip3Zb 

<400> 29 

tcctttgata cgcaggtgta atttcag 27 



<210> 30 

<211> 13829 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PNOV2149 



attgcatgtc 


taagttataa 


aaaattacca catatttttt ttgtcacact tgtttgaagt 


120 


gcagtttatc 


tatctttata 


catatattta aactttactc tacgaataat ataatctata 


180 


gtactacaat 


aatatcagtg 


ttttagagaa tcatataaat gaacagttag acatggtcta 


240 


aaggacaatt 


gagtattttg 


acaacaggac tctacagttt tatcttttta gtgtgcatgt 


300 


gttctccttt 


ttttttgcaa 


atagcttcac ctatataata cttcatccat tttattagta 


360 


catccattta 


gggtttaggg 


ttaatggttt ttatagacta atttttttag tacatctatt 


420 



<400> '28 

ggcatttatg gatttgccac tggtatc 



27 



<400> 30 



aagcttgcat gcctgcagtg cagcgtgacc cggtcgtgcc cctctctaga gataatgagc 



60 



29 



wo 03/075655 



PCT/US03/04735 



ttattctatt 


ttagcctcta 


aattaagaaa 


actaaaactc 


tattttagtt 


tttttattta 


480 


ataatttaga 


tataaaatag 


aataaaataa 


agtgactaaa 


aattaaacaa 


atacccttta 


540 


agaaattaaa 


aaaactaagg 


aaacattttt 


cttgtttcga 


gtagataatg 


ccagcctgtt 


600 


aaacgccgtc 


gacgagtcta 


acggacacca 


accagcgaac 


cagcagcgtc 


gcgtcgggcc 


660 


aagcgaagca 


gacggcacgg 


catctctgtc 


gctgcctctg 


gacccctctc 


gagagttccg 


720 


ctccaccgtt 


ggacttgctc 


cgctgtcggc 


atccagaaat 


tgcgtggcgg 


agcggcagac 


780 


gtgagccggc 


acggcaggcg 


gcctcctcct 


cctctcacgg 


caccggcagc 


tacgggggat 


840 


tcctttccca 


ccgctccttc 


gctttccctt 


cctcgcccgc 


cgtaataaat 


agacaccccc 


900 


tccacaccct 


ctttccccaa 


cctcgtgttg 


ttcggagcgc 


acacacacac 


aaccagatct 


960 


cccccaaatc 


cacccgtcgg 


cacctccgct 


tcaaggtacg 


ccgctcgtcc 


tccccccccc 


1020 


cccctctcta 


ccttctctag 


atcggcgttc 


cggtccatgg 


ttagggcccg 


gtagttctac 


1080 


ttctgttcat 


gtttgtgtta 


gatccgtgtt 


tgtgttagat 


ccgtgctgct 


agcgttcgta 


1140 


cacggatgcg 


acctgtacgt 


cagacacgtt 


ctgattgcta 


acttgccagt 


gtttctcttt 


1200 


ggggaatcct 


gggatggctc 


tagccgttcc 


gcagacggga 


tcgatttcat 


gatttttttt 


1260 


gtttcgttgc 


atagggtttg 


gtttgccctt 


ttcctttatt 


tcaatatatg 


ccgtgcactt 


1320 


gtttgtcggg 


tcatcttttc 


atgctttttt 


ttgtcttggt 


tgtgatgatg 


tggtctggtt 


1380 


gggcggtcgt 


tctagatcgg 


agtagaattc 


tgtttcaaac 


tacctggtgg 


atttattaat 


1440 


tttggatctg 


tatgtgtgtg 


ccatacatat 


tcatagttac 


gaattgaaga 


tgatggatgg 


1500 


aaatatcgat 


ctaggatagg 


tatacatgtt 


gatgcgggtt 


ttactgatgc 


atatacagag 


1560 


atgctttttg 


ttcgcttggt 


tgtgatgatg 


tggtgtggtt 


gggcggtcgt 


tcattcgttc 


1620 


tagatcggag 


tagaatactg 


tttcaaacta 


cctggtgtat 


ttattaattt 


tggaactgta 


1680 


tgtgtgtgtc 


atacatcttc 


atagttacga 


gtttaagatg 


gatggaaata 


tcgatctagg 


1740 


ataggtatac 


atgttgatgt 


gggttttact 


gatgcatata 


catgatggca 


tatgcagcat 


1800 


ctattcatat 


gctctaacct 


tgagtaccta 


tctattataa 


taaacaagta 


tgttttataa 


1860 


ttattttgat 


cttgatatac 


ttggatgatg 


gcatatgcag 


cagctatatg 


tggatttttt 


1920 


tagccctgcc 


ttcatacgct 


atttatttgc 


ttggtactgt 


ttcttttgtc 


gatgctcacc 


1980 


ctgttgtttg 


gtgttacttc 


tgcagggatc 


caccatgaac 


aagaacaaca 


ccaagctctc 


2040 


cacccgcgcc 


ctcccgtcct 


tcatcgacta 


cttcaacggc 


atctacggct 


tcgccaccgg 


2100 


catcaaggac 


atcatgaaca 


tgatcttcaa 


gaccgacacc 


ggcggcaacc 


tcaccctcga 


2160 


cgagatcctc 


aagaaccagc 


agctcctcaa 


cgagatcagc 


ggcaagctcg 


acggcgtgaa 


2220 


cggctccctc 


aacgacctca 


tcgcccaggg 


caacctcaac 


accgagctgt 


ccaaggagat 


2280 


cctcaagatc 


gccaacgagc 


agaaccaggt 


gctcaacgac 


gtgaacaaca 


agctcgacgc 


2340 


catcaacacc 


atgctccaca 


tctacctccc 


gaagatcacc 


tccatgctct 


ccgacgtgat 


2400 
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gaagcagaac 


tacgccctct 


ccctccagat 


cgagtacctc 


tccaagcagc 


tccaggagat 


2460 


cagcgacaag 


ctcgacatca 


tcaacgtgaa 


cgtgctcatc 


aactccaccc 


tcaccgagat 


2520 


caccccggcc 


taccagcgca 


tcaagtacgt 


gaacgagaag 


ttcgaggagc 


tgaccttcgc 


2580 


caccgagacc 


accctcaagg 


tgaagaagga 


ctcctccccg 


gccgacatcc 


tcgacgagct 


2640 


gaccgagctg 


accgagctgg 


ccaagtccgt 


gaccaagaac 


gacgtggacg 


gccucgagcc 


2700 


ctacctcaac 


accttccacg 


acgtgatggt 


gggcaacaac 


ctcttcggcc 


gctccgccct 


2760 


caagaccgcc 


tccgagctga 


tcgccaagga 


gaacgtgaag 


acctccggct 


ccgaggtggg 


2820 


caacgtgtac 


aacttcctca 


tcgtgctcac 


cgccctgcag 


gccaaggcct 


tcctcaccct 


2880 


caccacctgc 


cgcaagctcc 


tcggcctcgc 


cggcatcgac 


tacacctcca 


tcatgaacga 


2940 


gcacctcaac 


aaggagaagg 


aggagttccg 


cgtgaacatc 


ctcccgaccc 


tctccaacac 


3000 


cttctccaac 


ccgaactacg 


ccaaggtgaa 


gggctccgac 


gaggacgcca 


agatgatcgt 


3060 


ggaggccaag 


ccgggccacg 


ccctcgtggg 


cttcgagatg 


tccaacgact 


ccatcaccgt 


3120 


gctcaaggtg 


tacgaggcca 


agctcaagca 


gaactaccag 


gtggacaagg 


actccctctc 


3180 


cgaggtgatc 


tacggcgaca 


ccgacaagct 


cttctgcccg 


gaccagtccg 


agcagatata 


3240 


ctacaccaac 


aacatcgtgt 


tcccgaacga 


gtacgtgatc 


accaagatcg 


acttcaccaa 


3300 


gaagatgaag 


accctccgct 


acgaggtgac 


cgccaacttc 


tacgactcct 


ccaccggcga 


3360 


gatcgacctc 


aacaagaaga 


aggtggagtc 


ctccgaggcc 


gagtaccgca 


ccctctccgc 


3420 


caacgacgac 


ggcgtgtaca 


tgccgctcgg 


cgtgatctcc 


gaaaccttcc 


tcaccccgat 


3480 


caacggcttc 


ggcctccagg 


ccgacgagaa 


ctcccgcctc 


atcaccctca 


cctgcaagtc 


3540 


ctacctccgc 


gagctgctcc 


tcgccaccga 


cctctccaac 


aaggagacca 


agctcatcgt 


3600 


gccgccgtcc 


ggcttcatct 


ccaacatcgt 


ggagaacggc 


tccatcgagg 


aggacaacct 


3660 


cgagccgtgg 


aaggccaaca 


acaagaacgc 


ctacgtggac 


cacaccggcg 


gcgtgaacgg 




caccaaggcc 


ctctacgtgc 


acaaggacgg 


cggcttctcc 


cagttcatcg 


gcgacaagct 


3780 


caagccgaag 


accgagtacg 


tgatccagta 


caccgtgaag 


ggcaagccgt 


ccatccacct 


3840 


caaggacgag 


aacaccggct 


acatccacta 


cgaggacacc 


aacaacaacc 


tcaaggacta 


3900 


ccagaccatc 


accaagcgct 


tcaccaccgg 


caccgacctc 


aagggcgtgt 


acctcatcct 


3960 


caagtcccag 


aacggcgacg 


aggcctgggg 


cgacaagttc 


accatccttg 


agatcaagcc 


4020 


ggccgaggac 


ctcctctccc 


cggagctgat 


caacccgaac 


tcctggatca 


ccaccccggg 


4080 


cgcctccatc 


tccggcaaca 


agctcttcat 


caacctcggc 


accaacggca 


ccttccgcca 


4140 


gtccctctcc 


ctcaactcct 


actccaccta 


ctccatctcc 


ttcaccgcct 


ccggcccgtt 


4200 


caacgtgacc 


gtgcgcaact 


cccgcagggt 


gctcttcgag 


cgctccaacc 


tcatgtcctc 


4260 


cacctcccac 


atctccggca 


ccttcaagac 


cgagtccaac 


aacaccggcc 


tctacgtgga 


4320 
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4800 
4860 



gctgtcccgc cgctccggcg gcggcggcca catctccttc gagaacgtgt ccatcaagta 4380 

gatctgagct ctagatcccc gaatttcccc gatcgttcaa acatttggca ataaagtttc 4440 

ttaagattga atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac 4500 

gttaagcatg taataattaa catgtaatgc atgacgttat ttatgagatg ggtttttatg 4560 

attagagtcc cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac 4620 

taggataaat tatcgcgcgc ggtgtcatct atgttactag atcgggaatt gggtaccagc 4680 

ttgcatgcct gcagtgcagc gtgacccggt cgtgcccctc tctagagata atgagcattg 4740 

catgtctaag ttataaaaaa ttaccacata ttttttttgt cacacttgtt tgaagtgcag 

tttatctatc tttatacata tatttaaact ttactctacg aataatataa tctatagtac 

tacaataata tcagtgtttt agagaatcat ataaatgaac agttagacat ggtctaaagg 4920 

acaattgagt attttgacaa caggactcta cagttttatc tttttagtgt gcatgtgttc 4980 

tccttttttt ttgcaaatag cttcacctat ataatacttc atccatttta ttagtacatc 5040 

catttagggt ttagggttaa tggtttttat agactaattt ttttagtaca tctattttat 5100 

tctattttag cctctaaatt aagaaaacta aaactctatt ttagtttttt tatttaataa 5160 

tttagatata aaatagaata aaataaagtg actaaaaatt aaacaaatac cctttaagaa 5220 

attaaaaaaa ctaaggaaac atttttcttg tttcgagtag ataatgccag cctgttaaac 5280 

gccgtcgacg agtctaacgg acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc 5340 

gaagcagacg gcacggcatc tctgtcgctg cctctggacc cctctcgaga gttccgctcc 5400 

accgttggac ttgctccgct gtcggcatcc agaaattgcg tggcggagcg gcagacgtga 5460 

gccggcacgg caggcggcct cctcctcctc tcacggcacc ggcagctacg ggggattcct 5520 

ttcccaccgc tccttcgctt tcccttcctc gcccgccgta ataaatagac accccctcca 5580 

caccctcttt ccccaacctc gtgttgttcg gagcgcacac acacacaacc agatctcccc 5640 

caaatccacc cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc cccccccccc 5700 

tctctacctt ctctagatcg gcgttccggt ccatggttag ggcccggtag ttctacttct 5760 

gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg 58?0 

gatgcgacct gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg 5880 

aatcctggga tggctctagc cgttccgcag acgggatcga tttcatgatt ttttttgttt 5940 

cgttgcatag ggtttggttt gcccttttcc tttatttcaa tatatgccgt gcacttgttt 6000 

gtcgggtcat cttttcatgc ttttttttgt cttggttgtg atgatgtggt ctggttgggc 6060 

ggtcgttcta gatcggagta gaattctgtt tcaaactacc tggtggattt attaattttg 6120 

gatctgtatg tgtgtgccat acatattcat agttacgaat tgaagatgat ggatggaaat 6180 

atcgatctag gataggtata catgttgatg cgggttttac tgatgcatat acagagatgc 6240 

tttttgttcg cttggttgtg atgatgtggt gtggttgggc ggtcgttcat tcgttctaga 6300 
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tcggagtaga 


atactgtttc 


aaactacctg 


gtgtatttat 


taattttgga actgtatgtg 


6360 


tgtgtcatac 


atcttcatag 


ttacgagttt 


aagatggatg 


gaaatatcga tctaggatag 


6420 


gtatacatgt 


tgatgtgggt 


tttactgatg 


catatacatg 


atggcatatg cagcatctat 


6480 


tcatatgctc 


taaccttgag 


tacctatcta 


ttataataaa 


caagtatgtt ttataattat 


6540 


tttgatcttg 


atatacttgg 


atgatggcat 


atgcagcagc 


tatatgtgga tttttttagc 


6600 


cctgccttca 


tacgctattt 


atttgcttgg 


tactgtttct 


tttgtcgatg ctcaccctgt 


6660 


tgtttggtgt 


tacttctgca 


gggatccccg 


atcatgcaaa 


aactcattaa ctcagtgcaa 


6720 


aactatgcct 


ggggcagcaa 


aacggcgttg 


actgaacttt 


atggtatgga aaatccgtcc 


6780 


agccagccga 


tggccgagct 


gtggatgggc 


gcacatccga 


aaagcagttc acgagtgcag 


6840 


aatgccgccg 


gagatatcgt 


ttcactgcgt 


gatgtgattg 


agagtgataa atcgactctg 


6900 


ctcggagagg 


ccgttgccaa 


acgctttggc 


gaactgcctt 


tcctgttcaa agtattatgc 


6960 


gcagcacagc 


cactctccat 


tcaggttcat 


ccaaacaaac 


acaattctga aatcggtttt 


7020 


gccaaagaaa 


atgccgcagg 


tatcccgatg 


gatgccgccg 


agcgtaacta taaagatcct 


7080 


aaccacaagc 


cggagctggt 


ttttgcgctg 


acgcctttcc 


ttgcgatgaa cgcgtttcgt 


7140 


gaattttccg 


agattgtctc 


cctactccag 


ccggtcgcag 


gtgcacatcc ggcgattgct 


7200 


cactttttac 


aacagcctga 


tgccgaacgt 


ttaagcgaac 


tgttcgccag cctgttgaat 


7260 


atgcagggtg 


aagaaaaatc 


ccgcgcgctg 


gcgattttaa 


aatcggccct cgatagccag 


7320 


cagggtgaac 


cgtggcaaac 


gattcgttta 


atttctgaat 


tttacccgga agacagcggt 


7380 


ctgttctccc 


cgctattgct 


gaatgtggtg 


aaattgaacc 


ctggcgaagc gatgttcctg 


7440 


ttcgctgaaa 


caccgcacgc 


ttacctgcaa 


ggcgtggcgc 


tggaagtgat ggcaaactcc 


7500 


gataacgtgc 


tgcgtgcggg 


tctgacgcct 


aaatacattg 


atattccgga actggttgcc 


7560 


aatgtgaaat 


tcgaagccaa 


accggctaac 


cagttgttga 


cccagccggt gaaacaaggt 


7620 


gcagaactgg 


acttcccgat 


tccagtggat 


gattttgcct 


tctcgctgca tgaccttagt 


7680 


gataaagaaa 


ccaccattag 


ccagcagagt 


gccgccattt 


tgttctgcgt cgaaggcgat 


7740 


gcaacgttgt 


ggaaaggttc 


tcagcagtta 


cagcttaaac 


cgggtgaatc agcgtttatt 


7800 


gccgccaacg 


aatcaccggt 


gactgtcaaa 


ggccacggcc 


gtttagcgcg tgtttacaac 


7860 


aagctgtaag 


agcttactga 


aaaaattaac 


atctcttgct 


aagctgggag ctcgatccgt 


7920 


cgacctgcag 


atcgttcaaa 


catttggcaa 


taaagtttct 


taagattgaa tcctgttgcc 


7980 


ggtcttgcga 


tgattatcat 


ataatttctg 


ttgaattacg 


ttaagcatgt aataattaac 


8040 


atgtaatgca 


tgacgttatt 


tatgagatgg 


gtttttatga 


ttagagtccc gcaattatac 


8100 


atttaatacg 


cgatagaaaa 


caaaatatag 


cgcgcaaact 


aggataaatt atcgcgcgcg 


8160 


gtgtcatcta 


tgttactaga 


tctgctagcc 


ctgcaggaaa 


tttaccggtg cccgggcggc 


8220 
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cagcatggcc 


gtatccgcaa 


tgtgttatta 


agttgtctaa 


gcgtcaattt 


gtttacacca 


8280 


caatatatcc 


tgccaccagc 


cagccaacag 


ctccccgacc 


ggcagctcgg 


cacaaaatca 


8340 


ccactcgata 


caggcagccc 


atcagaatta 


attctcatgt 


ttgacagctt 


atcatcgact 


8400 


gcacggtgca 


ccaatgcttc 


tggcgtcagg 


cagccatcgg 


aagctgtggt 


atggctgtgc 


8460 


aggtcgtaaa 


tcactgcata 


attcgtgtcg 


ctcaaggcgc 


actcccgttc 


tggataatgt 


8520 


tttttgcgcc 


gacatcataa 


cggttctggc 


aaatattctg 


aaatgagctg 


ttgacaatta 


8580 


atcatccggc 


tcgtataatg 


tgtggaattg 


tgagcggata 


acaatttcac 


acaggaaaca 


8640 


gaccatgagg 


gaagcgttga 


tcgccgaagt 


atcgactcaa 


ctatcagagg 


tagttggcgt 


8700 


catcgagcgc 


catctcgaac 


cgacgttgct 


ggccgtacat 


ttgtacggct 


ccgcagtgga 


8760 


tggcggcctg 


aagccacaca 


gtgatattga 


tttgctggtt 


acggtgaccg 


taaggcttga 


8820 


tgaaacaacg 


cggcgagctt 


tgatcaacga 


ccttttggaa 


acttcggctt 


cccctggaga 


8880 


gagcgagatt 


ctccgcgctg 


tagaagtcac 


cattgttgtg 


cacgacgaca 


tcattccgtg 


8940 


gcgttatcca 


gctaagcgcg 


aactgcaatt 


tggagaatgg 


cagcgcaatg 


acattcttgc 


9000 


aggtatcttc 


gagccagcca 


cgatcgacat 


tgatctggct 


atcttgctga 


caaaagcaag 


9060 


agaacatagc 


gttgccttgg 


taggtccagc 


ggcggaggaa 


ctctttgatc 


cggttcctga 


9120 


acaggatcta 


tttgaggcgc 


taaatgaaac 


cttaacgcta 


tggaactcgc 


cgcccgactg 


9180 


ggctggcgat 


gagcgaaatg 


tagtgcttac 


gttgtcccgc 


atttggtaca 


gcgcagtaac 


9240 


cggcaaaatc 


gcgccgaagg 


atgtcgctgc 


cgactgggca 


atggagcgcc 


tgccggccca 


9300 


gtatcagccc 


gtcatacttg 


aagctaggca 


ggcttatctt 


ggacaagaag 


atcgcttggc 


9360 


ctcgcgcgca 


gatcagttgg 


aagaatttgt 


tcactacgtg 


aaaggcgaga 


tcaccaaagt 


9420 


agtcggcaaa 


taaagctcta 


gtggatctcc 


gtacccccgg 


gggatctggc 


tcgcggcgga 


9480 


cgcacgacgc 


cggggcgaga 


ccataggcga 


tctcctaaat 


caatagtagc 


tgtaacctcg 


9540 


aagcgtttca 


cttgtaacaa 


cgattgagaa 


tttttgtcat 


aaaattgaaa 


tacttggttc 


9600 


gcatttttgt 


catccgcggt 


cagccgcaat 


tctgacgaac 


tgcccattta 


gctggagatg 


9660 


attgtacatc 


cttcacgtga 


aaatttctca 


agcgctgtga 


acaagggttc 


agattttaga 


9720 


ttgaaaggtg 


agccgttgaa 


acacgttctt 


cttgtcgatg 


acgacgtcgc 


tatgcggcat 


9780 


cttattattg 


aataccttac 


gatccacgcc 


ttcaaagtga 


ccgcggtagc 


cgacagcacc 


9840 


cagttcacaa 


gagtactctc 


ttccgcgacg 


gtcgatgtcg 


tggttgttga 


tctaaattta 


9900 


ggtcgtgaag 


atgggctcga 


gatcgttcgt 


aatctggcgg 


caaagtctga 


tattccaatc 


9960 


ataattatca 


gtggcgaccg 


ccttgaggag 


acggataaag 


ttgttgcact 


cgagctagga 


10020 


gcaagtgatt 


ttatcgctaa 


gccgttcagt 


atcagagagt 


ttctagcacg 


cattcgggtt 


10080 


gccttgcgcg 


tgcgccccaa 


cgttgtccgc 


tccaaagacc 


gacggtcttt 


ttgttttact 


10140 


gactggacac 


ttaatctcag 


gcaacgtcgc 


ttgatgtccg 


aagctggcgg 


tgaggtgaaa 


10200 
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gcgu&^u&ag 


agaaaccccg 


cgacgcuCua 




U C ^ C ^ C d u> 




ugccagucga 


guacgcgacg 


aggaggubua 


ugacaggagti. 


1 n^on 




♦* r'a ^ ^ ^ +* rra rr 

u U U ^7 




& s n 4" MS f^t^ 






X vODli 


i* 9 a a A a 


o a rro A a rra rr/r 
ci(^ wci eisj asj %j 




^^^^^4'/V9^rv 

u^cu^ugacg 




MM^* 4" MMM 9 M 

t>u ucgcac 


XU44U 




^ a ^ ^ #v 

i>gy^cagcctg 


agccaa uucc 


Cagauccccg 


aggaaucggc 


gugagcgguc 


xuouu 


9caa.accdtic 


cggcccggta 


caaalicggcg 


cggcgctggg 


tgatigaccl^g 


guggagaagu 






gcaggccgcc 


cagcggcaac 


gcalscgaggc 


agaagcacgc 


cccggtgaa'k 


XUb^U 


cgtggcaagc 


ggccgctgat 


cgaatccgca 


aagaatcccg 


gcaaccgccg 


gcagccggtig 


1 neon 


cgccgtcgat 


taggaagccg 


cccaagggcg 


acgagcaacc 


agatttutcc 


gttccgatgc 


1074Q 


tctatgacgt 


gggcacccgc 


gatagtcgca 


gca'tcatgga 


cgtggccgtt 


ucccgcccgc 




cgaagcgtga 


ccgacgagct 


ggcgaggtga 


tccgctacga 


gcttccagac 


gggcacgtag 


1 noen 
IDooO 


aggtttccgc 


agggccggcc 


ggcatggcca 


gtgtgtggga 


ttacgacctg 


gtactgatgg 


1 nA4A 
10920 


cggtttccca 


tctaaccgaa 


tccatgaacc 


gataccggga 


agggaaggga 


gacaagcccg 


10980 


gccgcgtgtt 


ccgtccacac 


gttgcggacg 


tactcaagtt 


ctgccggcga 


gccgatggcg 


11040 


gaaagcagaa 


agacgacctg 


gtagaaacct 


gcattcggtt 


aaacaccacg 


cacgtbgcca 


til An 
11100 


tgcagcgtac 


gaagaaggcc 


aagaacggcc 


gcctggtgac 


ggtatccgag 


ggtgaagcct 


111 cn 
llloU 


tgattagccg 


ctacaagatc 


gtaaagagcg 


aaaccgggcg 


gccggag'tac 


atcgagatcg 


llz^U 


agctagctga 


ttggatguac 


cgcgagatca 


cagaaggcaa 


gaacccggac 


gt.gctgacgg 


llZoU 


ttcaccccga 


ttactttttg 


atcgatcccg 


gcat:cggccg 


ttttctcrac 


cgcctggcac 


llo4U 


gccgcgccgc 


aggcaaggca 


gaagccagat 


ggtitigtitcaa 


gacga'tc'tac 


gaacgcagtg 


11/1 r*n 


gcagcgccgg 


agagttcaag 


aagttctgtt 


tcaccgtgcg 


caagctgatc 


gggt.caaa^g 


1 1 yi cn 


acctgccgga 


gtacgatttg 


aaggaggagg 


cggggcaggc 


tggcccgatc 


ctagtcatgc 




gctaccgcaa 


cctgatcgag 


ggcgaagcat 


ccgccggttc 


ctaatgtacg 


gagcagatgc 


1 T c Q n 


tagggcaaat 


tgccctagca 


ggggaaaaag 


gt-cgaaaagg 


tCtCtttCCt 


gtiggatagca 


XX oftu 


cgtacattgg 


gaacccaaag 


ccgtacattg 


ggaaccggaa 


cccgt:acat.t 


gggaacccaa 


XX / uu 


agccgtacat 


tgggaaccgg 


tcacacatgt 


aagtgactga 


tataaaagag 


aaaaaaggcg 


1 1 T cn 
iJL /bU 


atttttccgc 


ctaaaactct 


ttaaaactta 


ttaaaactct 


taaaacccgc 


ctggcctgtg 


11820 


cataactgtc 


tggccagcgc 


acagccgaag 


agctgcaaaa 


agcgcctacc 


cttcggtcgc 


11880 


tgcgctccct 


acgccccgcc 


gcttcgcgtc 


ggcctatcgc 


ggccgctggc 


cgctcaaaaa 


11940 


tggctggcct 


acggccaggc 


aatctaccag 


ggcgcggaca 


agccgcgccg 


tcgccactcg 


12000 


accgccggcg 


ctgaggtctg 


cctcgtgaag 


aaggtgttgc 


tgactcatac 


caggcctgaa 


12060 


tcgccccatc 


atccagccag 


aaagtgaggg 


agccacggtt 


gatgagagct 


ttgttgtagg 


12120 
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tggaccagtt 


ggtgattttg 


aacttttgct 


ttgccacgga acggtctgcg ttgtcgggaa 


12180 


gatgcgtgat 


ctgatccttc 


aactcagcaa 


aagttcgatt tattcaacaa agccgccgtc 


12240 


ccgtcaagtc 


agcgtaatgc 


tctgccagtg 


ttacaaccaa ttaaccaatt ctgattagaa 


12300 


aaactcatcg 


agcatcaaat 


gaaactgcaa 


tttattcata tcaggattat caataccata 


12360 


tttttgaaaa 


agccgtttct 


gtaatgaagg 


agaaaactca ccgaggcagt tccataggat 


12420 


ggcaagatcc 


tggtatcggt 


ctgcgattcc 


gactcgtcca acatcaatac aacctattaa 


12480 


tttcccctcg 


tcaaaaataa 


ggttatcaag 


tgagaaatca ccatgagtga cgactgaatc 


12540 


cggtgagaat 


ggcaaaagct 


ctgcattaat 


gaatcggcca acgcgcgggg agaggcggtt 


12600 


tgcgtattgg 


gcgctcttcc 


gcttcctcgc 


tcactgactc gctgcgctcg gtcgttcggc 


12660 


tgcggcgagc 


ggtatcagct 


cactcaaagg 


cggtaatacg gttatccaca gaatcagggg 


12720 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


gccagcaaaa ggccaggaac cgtaaaaagg 


12780 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


gcccccctga cgagcatcac aaaaatcgac 


il2840 


gctcaagtca 


gaggtggcga 


aacccgacag 


gactataaag ataccaggcg tttccccctg 


12900 


gaagctccct 


cgtgcgctct 


cctgttccga 


ccctgccgct taccggatac ctgtccgcct 


12960 


ttctcccttc 


gggaagcgtg 


gcgctttctc 


atagctcacg ctgtaggtat ctcagttcgg 


13020 


tgtaggtcgt 


tcgctccaag 


ctgggctgtg 


tgcacgaacc ccccgttcag cccgaccgct 


13080 


gcgccttatc 


cggtaactat 


cgtcttgagt 


ccaacccggt aagacacgac ttatcgccac 


13140 


tggcagcagc 


cactggtaac 


aggattagca 


gagcgaggta tgtaggcggt gctacagagt 


13200 


tcttgaagtg 


gtggcctaac 


tacggctaca 


ctagaagaac agtatttggt atctgcgctc 


13260 


tgctgaagcc 


agttaccttc 


ggaaaaagag 


ttggtagctc ttgatccggc aaacaaacca 


13320 


ccgctggtag 


cggtggtttt 


tttgtttgca 


agcagcagat tacgcgcaga aaaaaaggat 


13380 


ctcaagaaga 


tcctttgatc 


ttttctacgg 


ggtctgacgc tcagtggaac gaaaactcac 


13440 


gttaagggat 


tttggtcatg 


agattatcaa 


aaaggatctt cacctagatc cttttgatcc 


13500 


ggaattaatt 


cctgtggttg 


gcatgcacat 


acaaatggac gaacggataa accttttcac 


13560 


gcccttttaa 


atatccgatt 


attctaataa 


acgctctttt ctcttaggtt tacccgccaa 


13620 


tatatcctgt 


caaacactga 


tagtttaaac 


tgaaggcggg aaacgacaat ctgatcatga 


13680 


gcggagaatt 


aagggagtca 


cgttatgacc 


cccgccgatg acgcgggaca agccgtttta 


13740 


cgtttggaac 


tgacagaacc 


gcaacgctgc 


aggaattggc cgcagcggcc atttaaatca 


13800 


attgggcgcg 


ccgaattcga 


gctcggtac 




13829 



<210> 31 

<211> 2367 

<212> DNA 

<213> Bacillus thuringiensis 
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<220> 

<221> inisc_feature 

<222> (1)..(2367) 

<223> Vip3C-12168 coding sequence 



<400> 31 
atgaatatga 


ataatactaa 


attaaacgca 


agggccctac cgagttttat 


tgattatttt 


60 


aatggcattt 


atggatttgc 


cactggtatc 


aaagacatta tgaatatgat 


ttttaaaacg 


120 


gatacaggtg 


gtaatctaac 


cttagacgaa 


atcctaaaga atcagcagtt 


actaaatgag 


180 


atttctggta 


aattggatgg 


ggtaaatggg 


agcttaaatg atcttatcgc 


acagggaaac 


240 


ttaaatacag 


aattatctaa 


ggaaatctta 


aaaattgcaa atgaacagaa 


tcaagtctta 


300 


aatgatgtta 


ataacaaact 


cgatgcgata 


aatacgatgc ttcatatata 


tctacctaaa 


360 


attacatcta 


tgttaagtga 


tgtaatgaag 


caaaattatg cgctaagtct 


gcaaatagaa 


420 


tacttaagta 


agcaattgca 


agaaatttct 


gataaattag atattattaa 


cgtaaatgtt 


480 


cttattaact 


ctacacttac 


tgaaattaca 


cctgcatatc aacggattaa 


atatgtgaat 


540 


gaaaaatttg 


aagaattaac 


ttttgctaca 


gaaaccactt taaaagtaaa 


aaaggatagc 


600 


tcgcctgctg 


atattcttga 


tgagttaact 


gaattaactg aactagcgaa 


aagtgttaca 


660 


aaaaatgacg 


ttgatggttt 


tgaattttac 


cttaatacat tccacgatgt 


aatggtagga 


720 


aataatttat 


tcgggcgttc 


agctttaaaa 


actgcttcag aattaattgc 


taaagaaaat 


780 


gtgaaaacaa 


gtggcagtga 


agtaggaaat 


gtttataatt tcttaattgt 


attaacagct 


840 


ctacaagcaa 


aagcttttct 


tactttaaca 


acatgccgaa aattattagg 


cttagcagat 


90,0 


attgattata 


cttctattat 


gaatgaacat 


ttaaataagg aaaaagagga 


atttagagta 


960 


aacatccttc 


ctacactttc 


taatactttt 


tctaatccta attatgcaaa 


agttaaagga 


1020 


agtgatgaag 


atgcaaagat 


gattgtggaa 


gctaaaccag gacatgcatt 


ggttgggttt 


1080 


gaaatgagca 


atgattcaat 


cacagtatta 


aaagtatatg aggctaagct 


aaaacaaaat 


1140 


tatcaagttg 


ataaggattc 


cttatcggag 


gttatttatg gtgatacgga 


taaattattt 


1200 


tgtccagatc 


aatctgaaca 


aatatattat 


acaaataaca tagtattccc 


aaatgaatat 


1260 


gtaattacta 


aaattgattt 


cactaaaaaa 


atgaaaactt taagatatga 


ggtaacagcg 


1320 


act w u u w ivd 




St cirsst rf 51 a a t* t" 


rracttiaaata acraaaaaaoii 


agaat^caagt. 


1380 


gaagcggagt 


atagaacgtt 


aagtgctaat 


gatgatggag tgtatatgcc 


attaggtgtc 


1440 


atcagtgaaa 


catttttgac 


tccgataaat 


gggtttggcc tccaagctga 


tgaaaattca 


1500 


agattaatta 


ctttaacatg 


taaatcatat 


ttaagagaac tactgctagc 


aacagactta 


1560 


agcaataaag 


aaactaaatt 


gatcgtccca 


ccaagtggtt ttattagcaa 


tattgtagag 


1620 


aacgggtcca 


tagaagagga 


caatttagag 


ccgtggaaag caaataataa 


gaatgcgtat 


1680 


gtagatcata 


caggcggagt 


gaatggaact 


aaagctttat atgttcataa 


ggacggagga 


1740 


ttttcacaat 


ttattggaga 


taagttaaaa 


ccgaaaactg agtatgtaat 
37 
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gttaaaggaa 
gatacaaata 
gatttaaagg 
aaatttacaa 
ccgaattctt 
ttggggacaa 
ataagcttta 
tttgaacgaa 
tccaataata 
tcatttgaaa 



aaccttctat 
ataatttaaa 
gagtgtattt 
ttttagaaat 
ggattacgac 
atgggacctt 
ctgcatcagg 
gcaaccttat 
ccggattata 
acgtttctat 



tcatttaaaa 
agattatcaa 
aattttaaaa 
taagcctgcg 
tccaggggct 
tagacaaagt 
accatttaat 
gtcttcaact 
tgtagaactt 
taaataa 



gatgaaaata 
actattacta 
agtcaaaatg 
gaggatttat 
agcatttcag 
ctttcattaa 
gtgacggtaa 
agtcatattt 
tcccgtcgct 



ctggatatat 
aacgttttac 
gagatgaagc 
taagcccaga 
gaaataaact 
acagttattc 
gaaattctag 
ctgggacatt 
ctggtggtgg 



tcattatgaa 
tacaggaact 
ttggggagat 
attaattaat 
tttcattaac 
aacttatagt 
ggaagtatta 
caaaactgaa 
tggtcatata 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2367 



<210> 32 

<211> 788 

<212> PRT 

<213> Bacillus thuringiensis 



<220> 

<221> MISC_FEATURE 

<222> (1)..(788) 

<223> Vip3C-12168 toxin 



<400> 32 

Met Asn Met Asn Asn Thr Lys Leu Asn Ala Arg Ala Leu Pro Ser Phe 
15 10 15 



He Asp Tyr Phe Asn Gly He Tyr Gly Phe Ala Thr Gly He Lys Asp 
20 25 30 



He Met Asn Met He Phe Lys Thr Asp Thr Gly Gly Asn Leu Thr Leu 
35 40 45 



Asp Glu He Leu Lys Asn Gin Gin Leu Leu Asn Glu He Ser Gly Lys 
50 55 60 



Leu Asp Gly Val Asn Gly Ser Leu Asn Asp Leu He Ala Gin Gly Asn 
65 70 75 80 



Leu Asn Thr Glu Leu Ser Lys Glu He Leu Lys He Ala Asn Glu Gin 
85 90 95 



Asn Gin Val Leu Asn Asp Val Asn Asn Lys Leu Asp Ala He Asn Thr 
100 105 110 



Met Leu His 
115 



He 



Tyr Leu Pro Lys He Thr Ser Met Leu Ser Asp Val 
120 125 
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Met Lys Gin Asn Tyr Ala Leu Ser Leu Gin lie Glu Tyr Leu Ser Lys 
130 135 140 



Gin Leu Gin Glu lie Ser Asp Lys Leu Asp lie lie Asn Val Asn Val 
145 150 155 160 



Leu lie Asn Ser Thr Leu Thr Glu lie Thr Pro Ala Tyr Gin Arg He 
165 170 175 



Lys Tyr Val Asn Glu Lys Phe Glu Glu Leu Thr Phe Ala Thr Glu Thr 
180 185 190 



Thr Leu Lys Val Lys Lys Asp Ser Ser Pro Ala Asp He Leu Asp Glu 
195 200 205 



Leu Thr Glu Leu Thr Glu Leu Ala Lys Ser Val Thr Lys Asn Asp Val 
210 215 220 



Asp Gly Phe Glu Phe Tyr Leu Asn Thr Phe His Asp Val Met Val Gly 
225 230 235 240 



Asn^Asn Leu Phe Gly Arg Ser Ala Leu Lys Thr Ala Ser Glu Leu He 
245 250 255 



Ala Lys Glu Asn Val Lys Thr Ser Gly Ser Glu Val Gly Asn Val Tyr 
260 265 270 



Asn Phe Leu He Val Leu Thr Ala Leu Gin Ala Lys Ala Phe Leu Thr 
275 280 285 



Leu Thr Thr Cys Arg Lys Leu Leu Gly Leu Ala Asp He Asp Tyr Thr 
290 295 300 



Ser He Met Asn Glu His Leu Asn Lys Glu Lys Glu Glu Phe Arg Val 
305 310 315 320 



Asn He Leu Pro Thr Leu Ser Asn Thr Phe Ser Asn Pro Asn Tyr Ala 
325 330 335 



Lys Val Lys Gly Ser Asp Glu Asp Ala Lys Met He Val Glu Ala Lys 
340 345 350 



Pro Gly His Ala Leu Val Gly Phe Glu Met Ser Asn Asp Ser He Thr 
355 360 365 



Val Leu Lys Val Tyr Glu Ala»Lys Leu Lys Gin Asn Tyr Gin Val Asp 
370 375 380 
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Lys Asp Ser Leu Ser Glu Val lie Tyr Gly Asp Thr Asp Lys Leu Phe 
385 390 395 400 



Cys Pro Asp Gin Ser Glu Gin lie Tyr Tyr Thr Asn Asn He Val Phe 
405 410 415 



Pro Asn Glu Tyr Val He Thr Lys He Asp Phe Thr Lys Lys Met Lys 
420 425 430 



Thr Leu Arg Tyr Glu Val Thr Ala Asn Phe Tyr Asp Ser Ser Thr Gly 
435 440 445 



Glu He Asp Leu Asn Lys Lys Lys Val Glu Ser Ser Glu Ala Glu Tyr 
450 455 460 



Arg Thr Leu Ser Ala Asn Asp Asp Gly Val Tyr Met Pro Leu Gly Val 
465 470 475 480 



He Ser Glu Thr Phe Leu Thr Pro He Asn Gly Phe Gly Leu Gin Ala 
485 490 495 



Asp Glu Asn Ser Arg Leu He Thr Leu Thr Cys Lys Ser Tyr Leu Arg 
500 505 510 



Glu Leu Leu Leu Ala Thr Asp Leu Ser Asn Lys Glu Thr Lys Leu He 
515 520 525 



Val Pro Pro Ser Gly Phe He Ser Asn He Val Glu Asn Gly Ser He 
530 535 540 



Glu Glu Asp Asn Leu Glu Pro Trp Lys Ala Asn Asn Lys Asn Ala Tyr 
545 550 555 560 



Val Asp His Thr Gly Gly Val Asn Gly Thr Lys Ala Leu Tyr Val His 
565 570 575 



Lys Asp Gly Gly Phe Ser Gin Phe He Gly Asp Lys Leu Lys Pro Lys 
580 585 590 



Thr Glu Tyr Val He Gin Tyr Thr Val Lys Gly Lys Pro Ser He His 
595 600 605 



Leu Lys Asp Glu Asn Thr Gly Tyr He His Tyr Glu Asp Thr Asn Asn 
610 615 620 



Asn Leu Lys Asp Tyr Gin Thr He Thr Lys Arg Phe Thr Thr Gly Thr 
625 630 635 640 
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Asp Leu Lys Gly Val Tyr Leu He Leu Lys Ser' Gin Asn Gly Asp Glu 
645 650 655 

Ala Trp Gly Asp Lys Phe Thr He Leu Glu He Lys Pro Ala Glu Asp 
660 665 670 

Leu Leu Ser Pro Glu Leu He Asn Pro Asn Ser Trp He Thr Thr Pro 
675 680 685 

Gly Ala Ser He Ser Gly Asn Lys Leu Phe He Asn Leu Gly Thr Asn 
690 695 700 

Gly Thr Phe Arg Gin Ser Leu Ser Leu Asn Ser Tyr Ser Thr Tyr Ser 
705 710 715 720 

He Ser Phe Thr Ala Ser Gly Pro Phe Asn Val Thr Val Arg Asn Ser 
725 730 735 

Arg Glu Val Leu Phe Glu Arg Ser Asn Leu Met Ser Ser Thr Ser His 
740 745 750 

He Ser Gly Thr Phe Lys Thr Glu Ser Asn Asn Thr Gly Leu Tyr Val 
755 760 765 

Glu Leu Ser Arg Arg Ser Gly Gly Gly Gly His He Ser Phe Glu Asn 
770 775 780 

Val Ser He Lys 
785 



<210> 33 
<211> 2367 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Maize optimized Vip3C-12168 
<400> 33 

atgaacaaga acaacaccaa gctcaacgcc cgcgccctcc cgtccttcat cgactacttc 60 
aacggcatct acggcttcgc caccggcatc aaggacatca tgaacatgat cttcaagacc 120 
gacaccggcg gcaacctcac cctcgacgag atcctcaaga accagcagct cctcaacgag 180 
atcagcggca agctcgacgg cgtgaacggc tccctcaacg acctcatcgc ccagggcaac 240 
ctcaacaccg agctgtccaa ggagatcctc aagatcgcca acgagcagaa ccaggtgctc 300 
aacgacgtga acaacaagct cgacgccatc aacaccatgc tccacatcta cctcccgaag 360 
atcacctcca tgctctccga cgtgatgaag cagaactacg ccctctccct ccagatcgag 420 
tacctctcca agcagctcca ggagatcagc gacaagctcg acatcatcaa cgtgaacgtg 480 
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ctcatcaact ccaccctcac cgagatcacc ccggcctacc agcgcatcaa gtacgtgaac 540 



tccccggccg acatcctcga cgagctgacc gagctgaccg agctggccaa gtccgtgacc 660 

aagaacgacg tggacggctt cgagttctac ctcaacacct tccacgacgt gatggtgggc 720 

aacaacctct tcggccgctc cgccctcaag accgcctccg agctgatcgc caaggagaac 780 

gtgaagacct ccggctccga ggtgggcaac gtgtacaact tcctcatcgt gctcaccgcc 840 

ctgcaggcca aggccttcct caccctcacc acctgccgca agctcctcgg cctcgccgac 900 

atcgactaca cctccatcat gaacgagcac ctcaacaagg agaaggagga gttccgcgtg 960 

aacatcctcc cgaccctctc caacaccttc tccaacccga actacgccaa ggtgaagggc 1020 

tccgacgagg acgccaagat gatcgtggag gccaagccgg gccacgccct cgtgggcttc 1080 

gagatgtcca acgactccat caccgtgctc aaggtgtacg aggccaagct caagcagaac 1140 

taccaggtgg acaaggactc cctctccgag gtgatctacg gcgacaccga caagctcttc 1200 

tgcccggacc agtccgagca gatatactac accaacaaca tcgtgttccc gaacgagtac 1260 

gtgatcacca agatcgactt caccaagaag atgaagaccc tccgctacga ggtgaccgcc 1320 

aacttctacg actcctccac cggcgagatc gacctcaaca agaagaaggt ggagtcctcc 1380 

gaggccgagt accgcaccct ctccgccaac gacgacggcg tgtacatgcc gctcggcgtg 1440 

atctccgaaa ccttcctcac cccgatcaac ggcttcggcc tccaggccga cgagaactcc 1500 

cgcctcatca ccctcacctg caagtcctac ctccgcgagc tgctcctcgc caccgacctc 1560 

tccaacaagg agaccaagct catcgtgccg ccgtccggct tcatctccaa catcgtggag 1620 

aacggctcca tcgaggagga caacctcgag ccgtggaagg ccaacaacaa gaacgcctac 1680 

gtggaccaca ccggcggcgt gaacggcacc aaggccctct acgtgcacaa ggacggcggc 1740 

ttctcccagt tcatcggcga caagctcaag ccgaagaccg agtacgtgat ccagtacacc 1800 

gtgaagggca agccgtccat ccacctcaag gacgagaaca ccggctacat ccactacgag 1860 

gacaccaaca acaacctcaa ggactaccag accatcacca agcgcttcac caccggcacc 1920 

gacctcaagg gcgtgtacct catcctcaag tcccagaacg gcgacgaggc ctggggcgac 1980 

aagttcacca tccttgagat caagccggcc gaggacctcc tctccccgga gctgatcaac 2040 

ccgaactcct ggatcaccac cccgggcgcc tccatctccg gcaacaagct cttcatcaac 2100 

ctcggcacca acggcacctt ccgccagtcc ctctccctca actcctactc cacctactcc 2160 

atctccttca ccgcctccgg cccgttcaac gtgaccgtgc gcaactcccg cgaggtgctc 2220 

ttcgagcgct ccaacctcat gtcctccacc tcccacatct ccggcacctt caagaccgag 2280 

tccaacaaca ccggcctcta cgtggagctg tcccgccgct ccggcggcgg cggccacatc 2340 

tccttcgaga acgtgtccat caagtag 2367 



gagaagttcg aggagctgac cttcgccacc gagaccaccc tcaaggtgaa gaaggactcc 



600 
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